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Quantum theory can be viewed as a generabzation of classical probability theory, but the anal- 
ogy as it has been developed so far is not complete. Classical probability theory is independent 
of causal structure, whereas the conventional quantum formalism requires causal structure to be 
fixed in advance. In this paper, we develop the formalism of quantum conditional states, which 
unifies the description of experiments involving two systems at a single time with the description of 
those involving a single system at two times. The analogies between quantum theory and classical 
probability theory are expressed succinctly within the formalism and it unifies the mathematical 
description of distinct concepts, such as ensemble preparation procedures, measurements, and quan- 
tum dynamics. We introduce a quantum generalization of Bayes' theorem and the associated notion 
of Bayesian conditioning. Conditioning a quantum state on a classical variable is the correct rule 
for updating quantum states in light of classical data, regardless of the causal relationship between 
the classical variable and the quantum degrees of freedom, but it does not include the projection 
postulate as a special case. We show that previous arguments that projection is the quantum gener- 
alization of conditioning are based on misleading analogies. Since our formalism is causally neutral, 
conditioning provides a unification of the predictive and retrodictive formalisms for prepare-and- 
measure experiments and leads to an elegant derivation of the set of states that a system can be 
"steered" to by making measurements on a remote system. 
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I. INTRODUCTION 

Quantum theory can be understood as a non- 
commutative generalization of classical probability the- 
ory wherein probability measures are replaced by density 
operators. Much of quantum information theory, espe- 
cially quantum Shannon theory, can be viewed as the 
systematic application of this generalization to classical 
information theory. 

However, despite the power of this point of view, the 
conventional formalism for quantum theory is a poor ana- 
log to classical probability theory because, in quantum 
theory, the appropriate mathematical description of an 
experiment depends on its causal structure. For exam- 
ple, experiments involving a pair of systems at space-like 
separation are described differently from those that in- 
volve a single system at two different times. The former 
are described by a joint state on the tensor product of 
two Hilbert spaces, and the latter by an input state and 
a dynamical map on a single Hilbert space. Classical 
probability works at a more abstract level than this. It 
specifies how to represent uncertainty prior to, and inde- 
pendently of, causal structure. For example, our uncer- 
tainty about two random variables is always described by 
a joint probability distribution, regardless of whether the 
variables represent two spacelike separated systems or the 
input and output of a classical channel. Although chan- 
nels represent time evolution, they are described math- 
ematically by conditional probability distributions. The 
input state specifies a marginal distribution, and thus we 
have the ingredients to define a joint probability distri- 
bution over the input and output variables. This joint 
probability distribution could equally well be used to de- 
scribe two spacelike separated variables. Therefore, we 
do not need to know how the variables are embedded in 
space-time in advance in order to apply classical prob- 
ability theory. This has the advantage that it cleanly 



separates the concept of correlation from that of causa- 
tion. The former is the proper subject of probabilistic 
inference and statistics. Within the subjective Bayesian 
approach to probability, independence of inference and 
causality has been emphasized by de Finetti ([l|. Preface 
pp. x-xi): 

Probabilistic reasoning — always to be un- 
derstood as subjective — merely stems from 
our being uncertain about something. It 
makes no difference whether the uncertainty 
relates to an unforeseeable future, or to an 
unnoticed past, or to a past doubtfully re- 
ported or forgotten; it may even relate to 
something more or less knowable (by means 
of a computation, a logical deduction, etc.) 
but for which we are not willing to make the 
effort; and so on. 

Thus, in order to build a quantum theory of Bayesian 
inference, we need a formalism that is even-handed in its 
treatment of different causal scenarios. There are some 
clues that this might be possible. Several authors have 
noted that that there are close connections, and often iso- 
morphisms, between the statistics that can be obtained 
from quantum experiments with distinct causal arrange- 
ments [2|-|8|. Time reversal symmetry is an example of 
this, but it is also possible to relate experiments involving 
two systems at the same time with those involving a sin- 
gle system at two times. The equivalence [3of prepare- 
and-measure [10| and entanglement-based |ll| quantum 
key distribution protocols is an example of this, and pro- 
vides the basis for proofs of the security of the former 
[l3 |. Such equivalences suggest that it may be possible 
to obtain a causally neutral formalism for quantum the- 
ory by describing such isomorphic experiments by similar 
mathematical objects. 

One of the main goals of this work is to provide this 
unification for the case of experiments involving two dis- 
tinct quantum systems at one time and those involving 
a single quantum system at two times, and to provide 
a framework for making probabilistic inferences that is 
independent of this causal structure. Both types of ex- 
periment can be described by operators on a tensor prod- 
uct of Hilbert spaces, differing from one another only by 
a partial transpose. Probabilistic inference is achieved 
using a quantum generalization of Bayesian condition- 
ing applied to quantum conditional states, which are the 
main objects of study of this work. 

Quantum conditional states are a generalization of 
classical conditional probability distributions. Condi- 
tional probability plays a key role in classical probabil- 
ity theory, not least due to its role in Bayesian infer- 
ence, and there have been attempts to generalize it to 
the quantum case. The most relevant to quantum infor- 
mation are perhaps the quantum conditional expectation 
|13| (see [Tj, |T^ for a basic introduction and [16] for a 
review) and the Cerf-Adami conditional density operator 



|T7hT9| . To date, these have not seen widespread apph- 
cation in quantum information, which casts some doubt 
on whether they are really the most useful generalization 
of conditional probability from the point of view of prac- 
tical applications. Quantum conditional states, which 
have previously appeared in [J, [20, UM,, provide an al- 
ternative approach to this problem. We show that they 
are useful for drawing out the analogies between classi- 
cal probability and quantum theory, they can be used to 
describe both space-like and time-like correlations, and 
they unify concepts that look distinct in the conventional 
formalism. For example, the descriptions of the prepara- 
tion of an ensemble of quantum states, the probabilities 
for the outcomes of measurements, and quantum dynam- 
ics can all be written as a generalization of the classical 
belief propagation rule (also called the law of total prob- 
ability) (TJ, which is given by 



P(5) = ^P(^|i?)P(i?). 



(1) 



The three cases differ only in the choice of which vari- 
ables; R, S or neither; remain classical in the generaliza- 
tion. We also show that the ensemble of states generated 
by the most general update rule for a quantum system 
after a measurement — a quantum instrument — can be 
described by belief propagation with respect to a quan- 
tum conditional state. 

We introduce a quantum version of Bayes' theorem, 
which generalizes a rule previously advocated by Fuchs 
[22| . Several well-known constructions in quantum infor- 
mation are instances of this theorem, including the cor- 
respondence between Positive Operator Valued Measures 
(POVMs) and ensemble decompositions of a density op- 
erator [3, 0, ^M 1 tbe "pretty good" measurement |23 - l26| , 
and the Barnum-Knill recovery map for approximate er- 
ror correction '27;]. 

Finally, we discuss conditioning a quantum state on 
a classical variable. This is the correct way to update 
quantum states in light of classical data, regardless of 
the causal relationship between the classical variable and 
the quantum degrees of freedom. The causal neutral- 
ity of our formalism unifies the treatment of predictions 
with that of retrodictions (inferences about the past), in 
analogy with the unification found in classical Bayesian 
inference. The retrodictive formalism we devise coincides 
with the one introduced in |28l - l3C| in the case of unbi- 
ased sources, but differs in the general case, retaining 
a closer analogy with classical Bayesian inference. The 
formalism also describes the case of conditioning on the 
outcome of a remote measurement, such as in the EPR 
experiment or more generally in "quantum steering" . Al- 
though our notion of conditioning does not include the 
projection postulate as a special case, we argue that it is 
nevertheless the correct way to update states in the light 
of measurement results, and that the assertion that the 
projection postulate is analogous to Bayesian condition- 
ing [3l|, IsJ is based on a misleading analogy. The latter 
is best described as the application of a belief propaga- 



tion rule (a non-selective update map), followed by con- 
ditioning (the selection). The remote measurement case 
provides an elegant derivation of the formula for the set 
of ensembles to which a remote system may be steered, 
previously obtained by conventional methods in ^] . 



A. Causal Neutrality 

Unifying the quantum description of experiments in- 
volving two distinct systems at one time with the de- 
scription of those involving a single system at two dis- 
tinct times requires some modifications to the way that 
the Hilbert space formalism of quantum theory is usually 
set up. Conventionally, a Hilbert space Ha describes a 
system, labeled A, that persists through time. Given two 
such systems, A and B, the joint system is described by 
the tensor product Hab = T^A ^ Hb- In the present 
work, a Hilbert space and its associated label should 
rather be thought of as representing a localized region of 
space-time. Specifically, an elementary region is a small 
space-time region in which an agent might possibly make 
a single intervention in the course of an experiment, for 
example by making a measurement or by preparing a 
specific state. Each elementary region is associated with 
a label. A, and a Hilbert space Ha- 

Generally, a region will refer to a collection of elemen- 
tary regions. A region that is composed of a pair of 
disjoint regions, labeled A and B, is ascribed the tensor 
product Hilbert space Hab = Ha €5 Hb- In contrast to 
the usual formalism, this applies regardless of whether A 
and B describe independent systems or the same system 
at two different times. Because of this, if an experiment 
involves a system that does persist through time, then 
a different label is given to each region it inhabits, e.g., 
the input and output spaces for a quantum channel are 
assigned different labels. 

As discussed in the introduction, we will make use of 
a conditional quantum state to achieve a unified descrip- 
tion of the spatial and temporal scenarios. In fact, al- 
though we motivate our work by the distinction between 
spatial and temporal separation, we find that it is not the 
spatio-temporal relation between the regions that is rel- 
evant for how they ought to be represented in our quan- 
tum generalization of probability theory. Rather, it is 
the causal relation that holds between them which is im- 
portant. 

More precisely, what is important is the distinction 
between two regions that are causally related, which is to 
say that one has a causal infiuence on the other (perhaps 
via intermediaries), and two regions that are acausally 
related, which is to say that neither has a causal influence 
on the other (although they may have a common cause 
or a common effect, or be connected via intermediaries 
to a common cause or a common effect). 

The causal relation between a pair of regions cannot 
be inferred simply from their spatio-temporal relation. 
Consider a relativistic quantum theory for instance. Al- 



though a pair of regions that are space-hke separated 
are always acausally related, a pair of regions that are 
time-like separated can be related causally, for instance 
if they constitute the input and the output of a chan- 
nel, or they can be related acausally, for instance if they 
constitute the input of one channel and the output of 
another. Although time- like separation implies that a 
causal connection is possible, it is whether such a con- 
nection actually holds that is relevant in our formalism. 
The distinction can also be made in nonrelativistic theo- 
ries, and in theories with exotic causal structure. Indeed, 
causal structure is a more primitive notion than spatio- 
temporal structure, and it is all that we need here. 

Typically, we shall confine our attention to two 
paradigmatic examples of causal and acausal separation 
(which can be formulated in either a relativistic or a non- 
relativistic quantum theory). Two distinct regions at 
the same time, the correlations between which are con- 
ventionally described by a bipartite quantum state, are 
acausally related. The regions at the input and output of 
a quantum channel, the correlations between which are 
conventionally described by an input state and a quan- 
tum channel, are causally related (although there are ex- 
ceptions, such as a channel which erases the state of the 
system and then re-prepares it in a fixed state) [75[- 

We unify the description of Bayesian inference in the 
two different causal scenarios in the sense that various 
formulas are shown to have precisely the same form, in 
particular, the relation between joints and conditionals, 
the formula for Bayesian inversion and the formula for 
belief propagation. 

Nonetheless, the different causal scenarios continue to 
be distinguished insofar as the set of operators that can 
represent a possible state in one scenario is different from 
the set that does so in the other scenario. This latter fact 
does not constitute a failure to achieve causal neutrality 
in the formalism for Bayesian inference because a similar 
phenomenon occurs classically. For instance, the causal 
relations among a triple of variables are significant for 
the sort of probability distribution that can be assigned 
to them. Specifically, if variable i? is a common cause of 
variables 5' and T, while there is no direct causal con- 
nection between S and T, then S and T should be con- 
ditionally independent given R, which is to say that the 
joint distribution over these variables is not arbitrary, but 
has the form P{R,S,T) = P{S\R)P{T\R)P{R). In our 
framework, the operator describing the state of some set 
of regions may also depend on the causal relations among 
those regions. 

There is, however, one sense in which the formalism for 
quantum Bayesian inference that we develop here is more 
sensitive to the causal structure than the formalism for 
classical Bayesian inference. In the latter, if we consider 
all the possible joint distributions over a pair of variables, 
R and S, we find that the set of possibilities is the same 
for the case where R and S are causally related as it is 
for the case where R and S are acausally related. 

To sum up, the fact that the set of possible states that 



can be assigned to a set of regions is constrained by the 
causal relation between those regions is common to the 
classical and quantum theories of inference. What is par- 
ticular to the theory of quantum inference is that even in 
the case of a pair of regions, the causal relation between 
the regions is relevant for the set of possible states that 
can be assigned to those regions [Tg. 



II. CLASSICAL CONDITIONAL PROBABILITY 

In this section, the basic definitions and formalism of 
classical conditional probability are reviewed, with a view 
to their quantum generalization in ijIIII 

Let R denote a (discrete) random variable, R = r the 
event that R takes the value r, P{R = r) the probability 
of event R — r, and P{R) the probability that R takes an 
arbitrary unspecified value. Finally, ^^ denotes a sum 
over the possible values of R. 

A conditional probability distribution is a function of 
two random variables P{S\R), such that for each value 
r of R, P{S\R = r) is a probability distribution over S. 
Equivalently, it is a positive function of R and S such 
that 



J2p{s\r) = i 



(2) 



independently of the value of R. 

Given a probability distribution P{R) and a condi- 
tional probability distribution P(S\R), a joint distribu- 
tion over R and S can be defined via 



P{R,S)^P{S\R)P{R), 



(3) 



where the multiplication is defined element-wise, i.e. for 
all values r,s oi R and S, P{R = r,S = s) = P{S = 
s\R = r)P{R^r). 

Conversely, given a joint distribution P{R,S), the 
marginal distribution over R is defined as 



P(i?) = ^P(i?,5), 



and the conditional probability of S given R is 

PiR,S) 



P{S\R) 



PiR) 



(4) 



(5) 



Note that eq. ([5]) only defines a conditional probability 
distribution for those values r oi R such that P{R ~ 
r) 7^ 0. The conditional probability is undefined for other 
values of R. 

The chain rule for conditional probabilities states 
that a joint probability over n random variables 
Ri,R2,. ■ ■ ,Rn can be written as 

P{Rl, R2, ■ ■ • , Rn) — P{Rn\Rl, R2, • ■ • , Rn-l) 

X P(i?„_i|i?i, i?2, . . . , Rn-2) ■ . . P(i?2|i?l)P(i?l). (6) 



Finally, note that the process of marginalizing a distri- 
bution over a set of variables commutes with the process 
of conditioning on a disjoint set of variables, as illustrated 
in the following commutative diagram. 



P{R,S,T) 

xP{T) 

P{R,S\T) 



Eh 



En 



> P{S,T) 

xP{T)' 

> P{S\T) 



(7) 



III. QUANTUM CONDITIONAL STATES 

In this section, the quantum analog of conditional 
probability — a conditional state — is introduced. We 
also discuss how the states assigned to disjoint regions are 
related via a quantum analog of the belief propagation 
rule P{S) = J2rP{S\R)P{R)- There is a smah differ- 
ence between conditional states for acausally related and 
causally related regions. The acausal case is discussed 
in flnSgnm ^IIICmilKI mainly concern the causal 
case, wherein we find that quantum dynamics, ensemble 
averaging, the Born rule, Heisenberg dynamics, and the 
transition from the initial state to the ensemble of states 
resulting from a measurement can all be represented as 
special cases of quantum belief propagation. However, 
acausal analogs of some of these ideas are also developed 
in these sections. 



A. Acausal Conditional States 

In this section, the quantum analog of a conditional 
probability distribution, a conditional state, is developed 
as it applies to acausally related regions. This causal sce- 
nario, and its classical analog, are depicted in fig. [TJ The 
definition proceeds in analogy with the classical treat- 
ment given in ijlll The convention of using A, _B, C, . . . 
to label quantum regions that are analogous to classical 
variables R,S,T,... is adopted throughout. The labels 
X,Y, Z, . . . are reserved for classical variables associated 
with preparations and measurements, which remain clas- 
sical when we pass from probability theory to the quan- 
tum analog. 

The analog of a probability distribution P{R) assigned 
to a random variable _R is a quantum state (density oper- 
ator) PA acting on a Hilbert space Ha- When there are 
two disjoint regions with Hilbert spaces Ha and Hb , the 
tensor product Hab — Ha ® Hb describes the compos- 
ite region. The quantum analog of a joint distribution 
P{R, S) is a density operator pab of the composite re- 
gion, defined on Hab- The analog of marginalization 
over a variable is the partial trace over a region. These 
analogies are set out in the top half of table HI 

In analogy to the classical case, where P{S\R) is a pos- 
itive function that satisfies J2s ^i'^l^) ~ ^^ ^^ acausal 
conditional state for B given A is defined as follows. 




FIG. 1: Acausally related quantum and classical regions. 
Classical variables are denoted by triangles and quantum re- 
gions by circles (this convention is suggested by the shape 
of the convex set of states in each theory). The dotted line 
represents acausal correlation, (a) Two quantum regions in 



an arbitrary joint state (possibly correlated), (b) Two classi- 
cal variables with an arbitrary joint probability distribution 
(possibly correlated). 



Classical Probability 


Quantum Theory 


PiR) 
P{R, S) 

p{s) = j:^p{r,s) 


PA 
pAB 

pB = TrA {pab) 


j:,p{s\R) = i 

P{R,S) ^PiS\R)P{R) 
P{S\R) = P{R,S)/P{R) 


Trs [pb\a) = Ia 

pab = pB\A * PA 
PB\A = PAB * Pa^ 



TABLE I: Analogies between classical probability theory for 
two random variables and quantum theory for two acausally 
related regions. 



Definition III.l. An acausal conditional state for B 
given yl is a positive operator pb\a on Hab = Ha ® Hb 
that satisfies 



Trs {pb\a) = Ia, 
where I a is the identity operator on Ha- 



(8) 



To provide an analogy with eq. ([3|) , a method of con- 
structing a joint state on Hab from a reduced state on 
Ha and a conditional state on Hab is required. This is 
given by 



PAB = (p1 «) Ib)pb\a{Pa ® ^b)- 



(9) 



Eq. (J9]) involves two constructions that appear repeat- 

edly in what follows. Firstly, the operators p\ and Pb\a 
are combined via multiplication, but they are defined on 

jL 

different spaces. To solve this problem, p\ is expanded to 
an operator on Hab by tensoring it with Ib - To simplify 
notation, the identity operators required to equalize the 
Hilbert spaces of two operators will be left implicit, so 



that if Mab is an operator on Hab and Nbc is an oper- 
ator on Hbc then MabNbc = {Mab <» Ic) {I a ® Nbc) 
and an equation hke Mab — Nbc is interpreted as 
Mab ® Ic ~ Ia ® Nbc- This notation ahows us to 
omit tensor product symbols where convenient, since 
Ma^Nb ^ {Ma (^ Ib) {Ia ® Nb) ^ MaNb- 

Secondly, rather than simply multiplying Pbia with 

PA in eq. ©, Pb\a is conjugated by p\. This ensures 
that the resulting joint operator is positive. To define a 
notation for this conjugation, let M and A'^ be positive 
operators on a Hilbert space H. Then define a (non- 
associative and non-commutative) product M -k N via 

M*N = nHiN^. (10) 

With these conventions, eq. © can be rewritten as 

Pas=Ps|a*Pa, (11) 

which looks a lot closer to eq. ([3]) than eq. © does. 

Starting with a joint state pab and its reduced state 
Pa = Tre {pab), a conditional state can be defined via 



Pbia 



Pab^Pa, 



(12) 



which is the analog of eq. ([S]) . 

As with eq. ^ there are problems with this formula 
if Pa is not supported on the entire Hilbert space T-La- 
In that case eq. ([T2|) is to be understood as an equation 
on the Hilbert space supp(pA) ® T-Lb, where supp(pyi) 
denotes the support of pA (the span of the eigenvec- 
tors of Pa having nonzero eigenvalues). Because of 
this, the resulting conditional density operator satisfies 
Trs {pb\a) = 4upp(pa) rather than eq. ^. 

The analogies between the classical and quantum rela- 
tions between conditionals, marginals and joints are set 
out in the bottom half of table ID As these analogies sug- 
gest, the •-product notation allows equations from clas- 
sical probability to be generalized to quantum theory by 
replacing functions by operators, products by •-products, 
and division by •-products with the inverse. However, 
whilst this is a useful way of postulating results in the 
conditional states formalism, one has to take care of the 
nonassociativity and noncommutativity of the •-product 
when making such generalizations. 

To provide an analogy with the chain rule of eq. (jS]) it 
is helpful to adopt the convention that, in the absence of 
parentheses, • products are evaluated right-to-left. Then, 
given n disjoint acausally related regions Ai,A2, . . . , An 
with Hilbert space ■HaiA2...a„ = ^^=i^Aj, the joint 
state can be written as 

PAiA2...A„ = PA„\AiA2...A„-i 

* PA„-i\AiA2...A„-2 * ■ ■ ■* PA2\Ai * PAi- (13) 

Finally, note that the process of marginalizing a con- 
ditional state over a region commutes with the process 



of conditioning on a disjoint region, as illustrated in the 
following commutative diagram: 



PABC 



Trc 



Pbc\a 



Trc 



> PAB 



(14) 



-^ Pb\a 



Example HI. 2 (Classical States). As one might expect, 
classical conditional probability is a special case of the 
quantum constructions outlined above. To see this, the 
classical variables have to be encoded in quantum regions 
in some way, and we adopt the convention of using the 
same letter to denote the classical variable and the cor- 
responding quantum region. Thus, Hb^Hs, "Ht, • . • refer 
to quantum regions that encode classical random vari- 
ables R, S,T, . . ., as opposed to Ha, 'Hb,'Hc, • ■ •, which 
are general quantum regions. 

For classical random variables R and 5, pick Hilbert 
spaces T-Lr and T-Ls with dimension equal to the number 
of distinct values of R and S respectively, and choose 
orthonormal bases {\r)} for "H^ and {|s)} for V-s labeled 
by the possible values of R and S. Then, joint, marginal 
and conditional probability distributions are encoded as 
operators via 

PBS - 5] F(i? = r, 5 = ,s) \r) (r|^ ® \s) {s\s (15) 
PR^Y. P(^ = '^) 1^) (^Ifl. = Trs {PRS) (16) 

r 

Using eqs. (I2][5]), it is straightforward to check that these 
operators satisfy eqs. ([8]), ([TT1) and ([T2|) . 

In order to unify the notation for classical vari- 
ables and quantum regions, the operators prs,Pr and 
PsiR are often used to directly represent the functions 
P{R, S), P{R) and P{S\R) without introducing the clas- 
sical functions explicitly. Whenever states and condi- 
tional states have subscripts R, S, T or X, Y, Z, they are 
implicitly assumed to be of this classical form. If needed, 
the classical functions can be read off from eqs. (|15m7p . 



Example HI. 3 (Pure Conditional States). A pure condi- 
tional state is one that is of the form pb\a = {ip) {^\b\a 



for some vector 



ib\a 



e 1-Lab- Since Tr^ {pb\a) = Ia, 



and I A has all eigenvalues equal to 1, the Schmidt de- 
composition of \ip) B\A ^^ '^^ the form 



lB\A 



E 
fc 



Uk)A® W) 



B ' 



(18) 



where {|wfe)} is an orthonormal basis for T-La and {|wfe)} 
is an orthonormal basis for T-Lb ■ This implies that a pure 
conditional state only exists if dim.{T-LA) < dim('Hs) be- 
cause otherwise there would not be enough orthonormal 
vectors on the B side to enforce Tr^ {pb\a) = Ia [77J . 



Since all the Schmidt coefficients are the same, the 
bases {|Mfc)} and {|ffc)} are highly non-unique. The con- 
ditional state |V')b|a itself only determines the relation- 
ship between the two Schmidt bases, i.e. for any basis in 
Ha it determines a corresponding basis in T-Ib- To see 
this, fix a reference basis, {|j)}, for Ha in order to define 
a complex conjugation operation. Next, define an isom- 

, where * denotes complex 



B \"-klA 



etry Ub\a = T,k \^k] 

conjugation in the {\j)} basis. Then, if {|wfc)^} is any 

other basis for Ha, eq. (HH) can be rewritten as 



'B\A 



J2\^k)A(^UB\A'\wl)A', 



(19) 



where A' labels a second copy of Ha- With respect to 
the reference basis {|j)}, this simplifies to 



/B\A 



U 



B\A' 



|$+) 



AA' 



(20) 



where \^+)aa' = Ej \n)AA'- 

Let Pa be an arbitrary density operator on Ha with 
eigendecomposition pA — J2kPk\'^k) {wk\A- Combining 
this with |f/)) (V'Ibia via eq. (|lip in order to define a joint 
state gives the projector onto the pure state 



1^) 



AB 



pIW 



B\A ■ 



(21) 



Combining this with eq. (|19p gives the Schmidt decom- 
position 



/AB 



Y^ VPk \wk)A «) Ub\A' \wI)a' 



(22) 



Since an arbitrary pure joint state is of this form, this 
shows that pure conditional states determine pure joint 
states when combined with arbitrary reduced states and, 
conversely, the conditional state of a pure joint state is 
always pure. 

Note that, using eq. (|20p instead of eq. (fT9)) gives 



ab = Pa Ubia'\<^^) 



AA' 



(23) 



which is a well known canonical decomposition of a bi- 
partite pure state. 



calculating the joint distribution P{R, S) ^ P{S\R)P{R) 
and then marginalizing over R. 

The belief propagation rule can be thought of as spec- 
ifying a linear map Tg^f/ from the space of probability 
distributions over R to the space of probability distribu- 
tions over S that preserves positivity and normalization. 
This is defined as 



rs\RiP{R))=Y.PiS\R)PiR). 



(25) 



Propagating beliefs about a quantum region A to an 
acausally related region B works in a similar way. If 
you specify a reduced state pA and a conditional state 
Pb\a then your state for B is determined by the acausal 
quantum belief propagation rule 



PB = Tr^ {pb\aPa) 



(26) 



which follows from the fact that the joint state is pab = 
Pb\a * PA, so that pb = Tr^i (yPB\A '*' Pa), and from the 
cyclic property of the trace. 

As in the classical case, acausal belief propagation can 
also be viewed as a linear map ^b\a from states on A to 
states on B that preserves positivity and normalization, 
defined by 



'^BiA (pa) = Tr^ {pb\aPa) 



(27) 



The linear map so defined is clearly positive because 
it maps states to states. It is not completely positive 
in general, but its composition with a transpose on A is 
completely positive. The map €b\a is in fact identical to 
the map associated to Pbia via the Jamiolkowski isomor- 
phism (34i] . which is a familiar construction in quantum 
information theory. These facts are consequences of the 
following theorem. 

Theorem III. 4 (Jamiolkowski Isomorphism). Let 
^B\A '■ ■C('Ha) ^ ^{Hb) be a linear map and let 
Mac € ^{Hac)j where He is a Hilhert space of arbi- 
trary dimension. Then, the action of ^b\a on £,{Ha) 
(tensored with the identity on £, (He)) is given by 

{<aB\A ® Ic) (Mac) = TrA {pb\aMac) , (28) 
where Pb\a G ■C(Hab) is given by 



B. Acausal Belief Propagation 

Suppose you characterize your beliefs about two clas- 
sical variables, R and S, by specifying a marginal prob- 
ability distribution P{R) and a conditional probability 
distribution P{S\R). Then, you can compute the proba- 
bility distribution you ought to assign to S via 



P(5)=^P(5|i?)P(i?). 



(24) 



This is called the classical belief propagation rule (also 
known as the law of total probability). It follows from 



Pb\a = (^bia'^Ia) J2 I-?') <^Ia ^ 1^) ^^l 



(29) 



j,k 



Here, A' labels a second copy of A, Ia is the identity 
superoperator on £,{Ha), o,nd {|j)} is an orthonormal 
basis for Ha ■ 

Furthermore, the operator Pb\a is an acausal condi- 
tional state, i.e. it satisfies definition \III.1[ if and only 
if iBbia ° Ta is CPT, where Ta : 2 (Ha) -^ 2,{Ha) de- 
notes the linear map implementing the partial transpose 
relative to some basis. 

The proof is provided in appendix [Xj 
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C. Causal Conditional States 

The analogy between conditional probabilities and 
conditional states presented so far is not complete. In 
conventional quantum theory, the tensor product Hab = 
Ha <8i Hb is used to represent a joint system with two 
subsystems, so that the conditional state Pb\a refers to 
the state of two subsystems at a given time. However, 
for classical conditional probabilities, there is no corre- 
sponding requirement that the two random variables R 
and S appearing in P(S\R) should have any particular 
causal relation to one another. Indeed, R might equally 
well represent the input to a classical channel and S the 
output, i.e. they may be causally related. This is illus- 
trated in fig. 2(b) If this is indeed the case, then the 
classical belief propagation rule of eqs. ([M|) and (^5]) can 
be interpreted as stochastic dynamics. 




pB = TrA {qb\aPa) 

Conventional notation: 
pB = £b\a{Pa) 



(a) 




P{S)^J:!,P{S\R)P{R) 



(b) 



FIG. 2: Causally related quantum and classical regions. The 
arrows represent the direction of causal influence. | (a) [ General 
quantum dynamics. A is the input to a CPT map and B is 
the output. |(b)| Classical stochastic dynamics. R is the input 
to a classical channel and S is the output. 

In order to formulate quantum theory as a causally 
neutral theory of Bayesian inference, the same formalism 
should be used to describe causally related regions as is 
used to describe acausally related regions. In particular, 
if A and B are two causally related regions, as depicted in 
fig. 2(a) then it ought to be possible to define a quantum 



conditional state for B given A as an operator on the 
tensor product Hab = Ha ® Hb- Towards this end, we 
make the following definition. 

Definition III. 5. A causal conditional state of B given 
A is an operator Qbia on Hab that can be written as 



Qb\a - Pb^a' 



(30) 



for some acausal conditional state Pb\a, where ^^ denotes 
the partial transpose in some basis on Ha- 

Thus, the set of causal conditional states is just the 
image under a partial transpose on the conditioning re- 
gion of the set of acausal conditional states. Note that, 
although the partial transpose is basis dependent, its im- 
age on the set of acausal conditional states is not and 
therefore neither is our definition of a causal conditional 
state. Also, because the set of acausal conditional states 
is mapped to itself by the full transpose (i.e. the transpose 
on AB), a partial transpose over the conditioned region 
B, rather than the conditioning region A, could alter- 
natively have been used to define a causal conditional 
state. 

Due to the partial transpose, causal conditional states 
are not positive operators in general, but they are always 
locally positive, i.e. (?A|^ (X) {<J)\b Qb\a Wa^ I'l') b — f^^' 

all|V')^€HA,|0)BGHB. 

In this section, we show how causal conditional states 
can be used for quantum belief propagation across 
causally related regions in the same way as acausal con- 
ditional states are used for acausally related regions, 
namely by a rule of the form pB — Tr^ {qb\aPa)- Belief 
propagation for dynamics with a quantum input and a 
quantum output are treated in ijIIIDI and i jIIIII treats be- 
lief propagation for causal conditional states themselves, 
which corresponds to composition of dynamical maps. 
ijIII El introduces the notion of a causal joint state, which 
is analogous to the joint distribution of input and output 
variables for a classical channel. ijIIIFI introduces the 
idea of a quantum-classical hybrid, which is a composite 
of a quantum region and a classical variable. This al- 
lows dynamics with a classical input and quantum output 
(and vice versa) to be described in terms of causal con- 
ditional states. These correspond to ensemble prepara- 
tion procedures and measurements, as discussed in JjIII Gl 
and JjIII HI In mil Jl the Heisenberg picture is translated 
into the conditional states formalism. In ^III Kl the most 
general type of state update rule that can occur after a 
measurement - a quantum instrument - is described in 
terms of causal conditional states. Table HI] summarizes 
the translation of these concepts from conventional no- 
tation to the conditional states formalism. 



D. Quantum Channels as Causal Belief 
Propagation 

Conventionally, the transition from a region A to a 
causally related later region B is described by a dynam- 
ical CPT map £b\a ■ S,{Ha) ^ £(Hb) such that, if 
PA is the state of A and pB is the state of B, then 
Pb — £b\a (pa)- However, causal conditional states can 
provide an alternative representation of quantum dynam- 
ics, as we will show. First note the following isomor- 
phism. 





Conventional Notation 


Conditional States Formalism 


Dynamics (CPT Map) 


£-B\A 

Pb = £b\a (pa) 


Qb\a 
PB = Tta (qbiaPa) 


Preparation (Set of States) 


P^ 

PA = T..P{X = x)pi 


Qa\x 
PA = Trx {qa\xPx) 


Measurement (POVM) 


P{Y = y) = TiA (E^Pa) 


Qy\a 
PY = Tta (qyiaPa) 


Composition of Dynamics 


£c\A ~ £c\B ° £b\A 


Qc\a = Trs {qc\bQb\a) 


Heisenberg picture (Dual Map) 


£■1" 
'^a\b 

^V = ^A\B i^v) 


Qb\a 

qy\a = TrB {gY\BQB\A) 


State-update (Quantum Instrument) 


oB\A 

Cy 

P{Y = y)p^ = f,^l^ (pa) 


Qyb\a 
pYB = TrA {qyb\aPa) 



TABLE II: Comparison of conventional and conditional state notation for CPT maps, POVMs, sets of states, quantum instru- 
ments and the Heisenberg picture. 



Theorem III.6. Let Eb\a ■ i^ ("Ha) ^ -2(^5) be a lin- 
ear map and let Qb\a G ^{T~Lab) be the Jamiolkowski- 
isomorphic operator, as defined in eq. (|29p . Then, Q b\a is 
a causal conditional state, i.e. it satisfies definition \III.5[ 
if and only if £b\a ** CPT. 

Proof. Theorem IIII.4I states that the operator pb\a that 
is Jamiolkowski isomorphic to a hnear map ^b\a is 
an acausal conditional state iff the hnear map £b\a = 
^B\A ° Ta is CPT. However, the Jamiolkowski isomor- 
phic operator to £b\a is Qb\a = Pb\a^ ^"^ *^^^ ^^ ^ causal 
conditional state iff £b\a is CPT. D 



Together with theorem IIIL41 this implies that the ac- 
tion of a CPT map £b\a on an operator Mac is given 

by 



£b\a (Mac) = Tr^ {qb\aM. 



iAC 



(31) 



where Qbia is the Jamiolkowski isomorphic operator to 

£b\a- 

Quantum dynamics may be represented by causal con- 
ditional states as follows. 

Proposition III. 7. Let gB\A be the Jamiolkowski iso- 
morphic causal conditional to a CPT map £b\a ihat de- 
scribes a quantum dynamics. If the initial state of region 
A is Pa, then the state of B, conventionally written as 



PB = £b\a {pa) , 



(32) 



can be expressed in the conditional states formalism as 
PB = TrA {qb\aPa) , (33) 

in analogy with the classical belief propagation rule 
eq. ^. 



We call eq. p3p the causal quantum belief propagation 
rule. It follows from eq. ((3T|) . 

Fig. [2] and the first line of table HIl summarize how this 
representation of quantum dynamics contrasts with the 
conventional representation, with fig. [5] emphasizing the 
analogy between the classical and quantum belief propa- 
gation rules. 



E. Causal Joint States 

JjIII Al showed that a joint state pab of two acausally 
related regions can be decomposed into a reduced state 
Pa and an acausal conditional state Pb\a- Similarly, two 
causally related regions can be described by an input 
state Pa and a causal conditional state QB\A^ but so far 
there is no causal analog of a joint state. This is ad- 
dressed by making the following definition, in analogy 
with eq. (ITT|) . 

Definition III. 8. A causal joint state of two causally 
related regions, A and i3, is an operator on T-Lab of the 
form 



QAB = Qb\a* PA, 



(34) 



where pA is a state on T-La and £1b|a is a causal conditional 
state of B given A. 

Note that the reduced state on A of qab is the initial 
state (input to the channel) and the reduced state on B 



Pb = Tr^ {gB\A*PA 



(35) 
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which, by the cychc property of the trace and proposi- 
tion IIII.7[ is the final state (output of the channel) . 

It is not too difficult to see that a causal joint state 
QAB is the partial transpose of an acausal joint state on 
Hab- Specifically, g% = Pbia^Pa"^ where Pb\a = B^^a 
is an acausal conditional state and py^ is a valid reduced 
state because the transpose preserves positivity. 

Thus, just as a causal conditional state for B given A 
is an operator on "Ha "Si "Hb that can be obtained as the 
partial transpose over A of an acausal conditional state 
Pb\Ai a causal joint state on AB is simply an operator on 
Ha ^ T~Lb that can be obtained as the partial transpose 
over A of an acausal joint state pab- 
Example III. 9 (Unitary Dynamics). Suppose a region 
A is assigned the state pA with eigendecomposition 



PA = T,tPj 



A is then mapped to a region 



B, which has a Hilbert space of the same dimension 
as that of A, by an isometry Ub\a = J2i l^j)s (""jIa- 
Since the Jamiolkowski isomorphism is basis indepen- 
dent, the causal conditional state associated with the 



map £b\a (•) = Ub\a (0 {Ub\a 
eigenbasis of pA as 



,t 



can be written in the 



Qbia 



\uj) (wfcU '^ Ub\a' \uk) (wjU- {Ub\a') (36) 



■J,k 



'^2^Wj){uk\A® \'"k){vj\- 

3,k 

It follows that the causal joint state for AB is 



(37) 



(38) 



QAB = 2^VPjPk\Uj) (Ufel^® \Vk){Vj\g. 

J.k 

Note that gB\A is the partial transpose over A of a pure 
acausal conditional state, as can be seen by comparison 
with eq. (IT51) from example lIII.31 Also, qab is the partial 
transpose over A of a pure acausal joint state pab = 
IV") (■0Iab' where 



'AB 



E 



/pJ|Uj)^(g) |u 



'3/B- 



(39) 



Causal joint states can be given an operational inter- 
pretation similar to that of acausal joint states by spec- 
ifying a procedure to perform tomography on them (see 
[3|, |j]). The motivation for introducing them here is that 
they allow quantum Bayesian inference to be developed 
in a way that is blind to the distinction between acausally 
and causally related regions. This is discussed in mV\ 

Note that whereas acausal joint states may involve 
more than two acausally related regions, causal joint 
states are thus far only well-defined for two causally re- 
lated regions. The reasons for this limitation are dis- 
cussed in PVIII 



F. Quantum-Classical Hybrid Regions 

An ensemble preparation procedure can be represented 
by a CPT map from a classical variable to a quan- 



tum region and a measurement can be represented as a 
CPT map from a quantum region to a classical variable. 
Therefore, these processes can be represented by condi- 
tional states for which either the conditioned or condi- 
tioning region is classical as a special case of proposi- 
tion nil. 71 In order to compare this to the conventional 
formalism, we need to describe how composite regions 
consisting of a classical variable and a quantum region 
are represented in the conditional states formalism. Such 
composites are called quantum-classical hybrid regions. 
As in example IIII.21 the classical variable X is asso- 
ciated with a Hilbert space Hx equipped with a pre- 
ferred basis {\x)} that represents the possible values of 
X. The quantum region is associated with a Hilbert 
space Ha and the joint region with the tensor product 
HxA = Hx "8) Ha- In order to preserve the classical 
nature of X, states and conditional states on HxA are 
restricted to be of the following form: 

Definition III. 10. A hybrid operator on HxA is an op- 
erator of the form 



MxA = Yl I 



Ix'^M^, 



(40) 



where {\x)} is a preferred basis for Hx and {M^} is a 
set of operators acting on Ha, labeled by the values of 
X . The operators M^ are referred to as the components 
oiMxA- 

When MxA is a state, this ensures that the reduced 
state on X is diagonal in the preferred basis and that 
there can be no entanglement between the quantum and 
classical regions. 

Although our primary interest is in causal hybrids, 
since these are relevant to preparations and measure- 
ments, one can also have acausal hybrids. As shown 
below, the set of acausal hybrid conditional states and 
the set of acausal hybrid joint states are invariant under 
partial transpose. It follows that the set of acausal hybrid 
states and the set of causal hybrid states are the same. In 
particular, this means that, unlike fully quantum causal 
states, hybrid causal states not only have positive partial 
transpose but are themselves positive, and unlike fully 
quantum acausal states, hybrid acausal states not only 
are positive but also have positive partial transpose. 

Therefore, for hybrid states, the notational distinction 
between p and g serves merely as a reminder of the causal 
arrangement of the regions under consideration. This in 
contrast with the fully quantum case, where the distinc- 
tion also has significance for the mathematical properties 
of the operator. When making claims about the math- 
ematical properties of hybrid conditional states that are 
independent of causal structure, the notation a is used. 
According to these conventions, any fornrula expressed 
in terms of ct's will yield a valid formula about hybrid 
states if the ct's are replaced by either p's or p's. 

For hybrid regions, there are two possible types of con- 
ditional state, depending on whether the conditioning is 
done on the quantum or the classical region. In the case 
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of conditioning on the classical variable, a hybrid condi- 
tional state (Jaix is a positive operator on JixA satisfying 



Tr^ (cta|x) = Ix- 



(41) 



In the case of conditioning on the quantum region, a 
hybrid conditional state ax\A is again a positive operator 
on HxA, but this time satisfying 



Trx {(Jx\a) = Ia- 



(42) 



G. Ensemble Averaging as Belief Propagation 

A hybrid conditional state of the form aA\x is a quan- 
tum state conditioned on a classical variable. The causal 
interpretation of such states is a process that takes a 
classical variable as input and outputs a quantum state. 
This is just an ensemble preparation procedure. In 
such a preparation procedure, a classical random vari- 
able X is generated (by flipping coins, rolling dice, or 
any other suitable method) with probability distribution 
P{X). Depending on the value x oi X thereby obtained, 
one of a set of quantum states {p^} is prepared for a 
quantum region A. If you do not know the value of 
X, then you should assign the ensemble average state 
Pa = J2r P {^ = ^)Px to ^- This scenario is depicted 



in fig. 3(a) A quantum preparation procedure has an 
obvious classical analogue wherein the quantum region 
A is replaced by a classical variable R that is prepared 
in one of a set of probability distributions P{R\X = x) 
depending on the value x oi X. If you do not know the 
value oi X, then the classical belief propagation rule spec- 
ifies that you should assign the probability distribution 
PiR) = fix P{R\X)P{X) to R. This case is illustrated 



in fig. 3(b) This section shows that a set of density op- 
erators can be represented by a hybrid conditional state 
of the form aA\x a-nd that the formula for the ensemble 
average state in a preparation procedure is a special case 
of quantum belief propagation. 

Theorem III. 11. Let aA\x be a hybrid operator, so that 
by eq. ^0^ it can be written as 



>A\x 



Y.p^® 



X {X 



X ' 



(43) 



for some set of operators {p^}- Then, <Ja\x satisfies 
the definition of both an acausal and a causal conditional 
state for A given X iff each of the components p^ is a 
normalized state on Ha ■ 

The proof is provided in appendix [X] 

It is often convenient to use the notation aA\x=x = 
{x\x aA\x \x)x ~ Px foi' the components of a conditional 
state of this form. 

If we adopt the convention that the partial transpose 
on the Hilbert space of a classical variable is performed 
in its preferred basis then it has no effect on hybrid oper- 
ators. Thus, acausal conditional states of the form pa\x 




pA = Trx (qaixPx) 
Conventional notation: 

PA = E. P{x = x)p^ 



(a) 




P{R)^j:^P{R\X)PiX) 



(b) 



FIG. 3: Quantum and classical preparation procedures, (a) A 



quantum preparation procedure is a process that takes a clas- 
sical variable X as input and outputs a quantum region A in 
one of a set {p^ } of states, depending on the value of X. It is 
mathematically equivalent to the special case of a CPT map 
where the input is classical, (b) A classical preparation proce- 



dure is a process that takes a variable X as input and outputs 
one of a set {P(R\X = a;)} of probability distributions over R, 
depending on the value of A". It is mathematically equivalent 
to the stochastic dynamics depicted in fig. |2(b)[ 



are invariant under partial transpose on X and this is 
why acausal and causal conditional states that are con- 
ditioned on the classical variable have the same form [73] ■ 
The remainder of this section concerns the causal inter- 
pretation of such states in terms of preparation proce- 
dures, so we shift to the notation qaix- 

Proposition III. 12. Let Qa\x be the causal hybrid con- 
ditional state with components given by a set of states 
{Px} ■ The ensemble average state arising from a prepa- 
ration procedure that samples a value x of a classical vari- 
able X from the distribution P{X) and prepares the state 

Px, 



" is given by 



PA 



E 



p{x - x)pi 



(44) 



This can be expressed in the conditional states formalism 
via the quantum belief propagation rule as 



PA = Trx {qa\xPx) , 
where px = J2x ^i^ = ^) 1^) 



(45) 



IX- 



This result follows simply from substituting the defini- 
tion of Px and gA\x into eq. (|45p . 

Fig. [3] and the second line of table |lT] summarize how 
the representation of a preparation procedure within the 
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conditional states formalism contrasts with the conven- 
tional representation and how the latter generalizes the 
analogous classical expression. 

It should be noted that theorem IIII.llI can alterna- 
tively be derived as a special case of theorem IIIL6I and 
proposition IIIL12I as a special case of proposition IIII.7I 
This follows from the fact that a preparation procedure 
can be represented by a CPT map £a\x from a classi- 
cal variable to a quantum region (sometimes called a CQ 
map). The map is defined on diagonal states px via 



£a\x{P- 



'X 



E' 



ixPxmxPx 



(46) 



By proposition IIII.7| the conditional state associated 
with the preparation is the Jamiolkowski isomorphic op- 
erator to this map. Eq. (|45)) is then obtained as a special 
case of eq. (|55)) where the input is classical. 



H. The Born Rule as Belief Propagation 

A hybrid conditional state of the form ayiA is a- clas- 
sical probability distribution conditioned on a quantum 
region. The causal interpretation of such states is as a 
process that takes a quantum region as input and out- 
puts a classical variable. This is just a measurement. 
The most general kind of measurement on a quantum re- 
gion A is conventionally represented by a POVM {E:^} 
with the classical variable Y ranging over the possible 
outcomes. If the state of the region is pA, then the prob- 
ability of obtaining outcome y is given by the Born rule 
as P{ Y — y ) = Tr^ (^E^pa)- This scenario is depicted 



in fig. 4(a) In the classical analogue, the quantum re- 
gion A is replaced by a classical variable R, the state 
PA is replaced by a distribution P{R) and the POVM 
is replaced by a (possibly noisy) classical measurement 
described by a set of response functions {P{Y ~ y\R)}, 
i.e. a set of functions of R labeled by the values of Y, 
where P{Y — y\R = r) specifies the probability of ob- 
taining the outcome y given that R — r. The over- 
all probability of obtaining the outcome y is given by 
P{Y = y) = J2rP{Y = y\R)P{R), which is just another 
instance of belief propagation. This case is illustrated in 
fig. |4(b)[ In analogy to this, the remainder of this section 
shows that the components of a conditional state (Jy\a 
form a POVM, and that the Born rule can be written 
as quantum belief propagation with respect to a causal 
conditional state of this form. 

Theorem III. 13. Let ay] a be a hybrid operator so that, 
by eq. (j40p . it can be written in the form 



CylA = 



y 



\y) {ylv ® E^ 



(47) 



for some set of operators {Ey}. Then, aY\A satisfies 
the definition of both an acausal and a causal conditional 
state for Y given A iff the set {E;^} is a POVM on Ha, 




pY = TrA {gY\APA) 

Conventional notation: 
P{Y = y) = TrA {E^Pa) 



(a) 




P{Y)^Y.bP{Y\R)P{R) 



(b) 



FIG. 4: Quantum and classical measurements, (a) A quan- 
tum measurement is a process that takes a quantum region 
A as input and outputs a classical variable Y . It is mathe- 
matically equivalent to the special case of a CPT map where 
the output is classical, (b) A classical (noisy) measurement 



is a process that takes a variable R as input and outputs a 
variable Y that depends on R, possibly in a coarse-grained or 
non-deterministic way. It is mathematically equivalent to the 
stochastic dynamics depicted in fig. 2(b) 



The proof is given in appendix \^ 

It is sometimes useful to use the notation (JY=y\A = 
(yly "'via \y)Y ~ ^v ^'^^ ^^"^ components of conditional 
states of this form. 

Unlike the case of hybrid states that are conditioned on 
a classical variable, conditional states of the form (7y\a 
are not invariant under partial transpose on the condi- 
tioning region. However, taking the partial transpose 
over A of Py\a yields another valid acausal conditional 

state, because [{E^f^'l is a POVM iff {E^} is. The 
remainder of this section concerns the causal interpreta- 
tion of such states, so the notation Qyia is adopted. 

Proposition III. 14. Consider a measurement of a 
POVM {Ey} on a quantum region A in state pA- Let 
gY\A be the causal conditional state with components Ey . 



The Born rule, 



P{Y = y)= TrA (E^Pa) 



(48) 



I.e. 



each Ey is positive and ^ Ey — / 



M- 



cxxn then be expressed in the conditional states formalism 
as the quantum belief propagation rule 

PY = TrA {qy\aPa) , (49) 

where pY = Y.yP{Y = y) \y) (yly 

This is easily verified by substituting the definition of 
Qy\a from eq. (gT]) into eq. dig]) . 
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This representation of a measurement as a causal con- 
ditional state and of the Born rule as an instance of belief 
propagation is summarized in fig. |4] and the third line of 
table ini 

Once again, these results can be understood as a spe- 
cial case of theorem IIIL6I and proposition IIII. 71 bv recog- 
nizing that a POVM may be represented as a map from a 
quantum region to a classical variable (sometimes called 
a QC map). Specifically, if the probability distribution 
P(Y) is represented by a diagonal state py, then the 
measurement can be represented by the CPT map £y\a 
defined by 



Py = Sy\a{pa) = X! ^^^ i^yPA) Iv) ivU 



(50) 



The causal conditional state Qyia appearing in eq. (j49p 
is simply the Jamiolkowski isomorphic operator to this 
map. 



I. Belief Propagation of Conditional States 

Consider three causally related regions A, B and C, 
such that B is in the future of A and C is in the future 
of B. If the dynamics is Markovian then it can be de- 
scribed by first applying a CPT map £b\a to A followed 
by a CPT map £c\b to B. This scenario is illustrated 
If we are only interested in regions A and C 



in fig. 5(a) 



then region B can be eliminated from the description by 
composing the two maps to obtain £c\A = £c\b ° £b\a, 
wherefo J-(.) = f (J'(-))- 

In the conditional states formalism the CPT maps are 
replaced by the Jamiolkowski isomorphic causal condi- 
tional states Qb\a^ Qc\b and Qc\A^ and thus £c\A = 
£c\B ° £b\a should be replaced by a formula for gc\A 
in terms of gB\A and gc\B- 

As an aid to intuition, consider the classical analog 
of this scenario as depicted in fig. |5(b)[ Here, the vari- 
able S is in the future of R, and T is in the future of S. 
The three variables are related by a Markovian dynam- 
ics, described by the conditional probability distributions 
PiS\R) and p'{T\S). An initial probability distribution 
P{R) can be propagated into the future to obtain P{T) 
in two steps. First we propagate from i? to S" to obtain 
PiS) = J2i^PiS\R)PiR), and then from 5 to T to ob- 
tain P{T) = ^5P(T|S')F(5). Combining these steps 
gives P{T) = Y^j^g P{T\S)P{S\R)P{R), so defining the 
conditional probability distribution 



p(r|i?) = ^p(T|^)p(5|i?), 



(51) 



allows the belief propagation from i? to T to be performed 
in a single step via P{T) = Efl,^(^l-R)-P(^)- 

The quantum analog of this is given by the following 
theorem. 

Theorem III. 15. Let £b\a< £c\b 'J'^^ £c\a be lin- 
ear maps such that £c\A — £c\b ° £b\a- Then, the 



C 



QciA = Trij (qc\bQb\. 



Conventional notation: 

Sc\A = £c\B ° £-B\A 



(a) 




P{T\R) = Es P{T\S)P{S\R) 



(b) 



FIG. 5; Propagating causal conditional states and conditional 
probability distributions. 



Quantum case, (b) Classical 



Jamiolkowski isomorphic operators, Qb\A! Qc\b o.'f^d Qc\a 
satisfy 

Bc\a = TrB (qcibSbia) ■ (52) 

Conversely, if three operators satisfy eq. (j52p . then the 
Jamiolkowski isomorphic maps satisfy £c\A = £c\B ° 
£b\a- 

The proof is provided in appendix [Xj 

Eq. (15^ can be regarded as a belief propagation rule 
for causal conditional states. It propagates beliefs about 
B, conditional on A, into the future to obtain beliefs 
about C, conditional on A. 

A similar formalism can be developed for the propa- 
gation of acausal conditional states across acausally re- 
lated regions. However, for present purposes, it is more 
interesting to consider a situation of mixed causality, 
wherein a causal conditional state is propagated across 
two acausally related regions. This is used in the appli- 
cation to steering developed in i]VA3l 

Consider the scenario depicted in fig. [6] Initially, a 
state pab is assigned to regions A and B, which are 
acausally related. A CPT map £c\b (alternatively repre- 
sented by a causal conditional state Qc\b) is then applied 
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to region B to obtain the state of AC. In this scenario, 
C is causally related to B, but acausally related to A. 
By theorem IIII.4| the state of AC is given by 

PAC = Trs {qcibPab) ■ (53) 

Now, pAB = Pb\a* PA and pAc = Pc\A * PA, so we have 

Pc|A*/OA ==Trs (£»c|sPb|a) *PA, (54) 

where we have used the fact that pA commutes with gc\B- 
Taking the •-product of this equation with p^ then gives 

Pc\A = Trs {qcibPbia) , (55) 

in analogy with eq. (I52p . This can again be viewed as a 



belief propagation rule for conditional states, but this 
time an acausal conditional state is being propagated 
through a causal conditional state. 





Pc\A = Trs {ec\BPB\. 



FIG. 6: Propagating an acausal conditional state through a 
causal conditional state. 



J. The Heisenberg Picture 



In §111 D| quantum evolution from an early region A 
to a late region B was described by a map from states 
on Ha to states on Hb- However, dynamics can alterna- 
tively be represented in terms of observables rather than 
states. This is simply the textbook distinction between 
the Schrodinger picture and the Heisenberg picture. In 
the Heisenberg picture, a temporal evolution is described 
by a map from the space of observables on the late region 
B to the space of observables on the early region A. The 
observables are usually represented by self-adjoint oper- 
ators and the dynamics by unitary operations, but this 
can be generalized to take account of generalized mea- 
surements and CPT dynamics. In this generalization, 
Heisenberg dynamics consists of a map from POVM el- 
ements (also known as effects) on B to POVM elements 
on A. In other words, effects are evolved backwards in 
time in the Heisenberg picture. 

In order to describe the Heisenberg picture for CPT 
maps, it is necessary to define the notion of a dual map. 

Definition III.16. The dual map (fsi^)^ : £(^b) ^ 
£ {Ha) of a linear map £b\a ■ ^ C^a) -^ ■C {He) is the 
unique map that satisfies 

Tr^ ({£B\Ay (Nb) Ma) - Tr^ {Nb£b\a (Ma)) (56) 



for ah Ma € £ CHa) , iVs G -C (Hb)- 

Note that the input space for the dual map is the out- 
put space for the original map and vice versa. The nota- 
tional convention: 



'-■A\B 



^ E 



B\A} 



(57) 



is adopted in order to make this clear. 

If £b\a is the map describing time evolution in the 
Schrodinger picture through the formula pB = Eb\a{pa), 
then the same evolution is described in the Heisenberg 
picture by the dual map £a\b through the formula E^ — 
^A\Bi^y) where {E^} and {E^} are POVMs. This fol- 
lows from the condition that the two pictures should be 
operationally equivalent, i.e. they should assign the same 
probabilities. To see this, imagine that the evolution 
from region A to -B is followed by a measurement on 
B yielding an outcome Y. This scenario is depicted in 
fig. [T] The probability of observing the effect E^ after 
a preparation of pA followed by an evolution £b\a is ex- 
pressed in the Schrodinger picture as Tr^ (^E^ pb) where 
Pb = ^BlAipA), while it is expressed in the Heisenberg 



picture as Tr^i (^EyPA) 



where Ey 



SlsiE^). Thedef- 



inition of the dual map ensures that the two expressions 
for the probability are equivalent, i.e. Tr^ {E^ pb) = 

Tya{E^Pa). 

A CPT map £b\a can always be written in a Kraus 
decomposition 



£B\A{-)=y^KA-)Ki 






(58) 



where K^, : %, 



Ui 



The dual map £a\b *^^^ then 



^ti ■ 'lA — ? ns- 
be obtained by taking the adjoint of the operators in the 
Kraus decomposition, i.e. 



'-'A\B 



(•) 



A^ 



kU-)k, 



(59) 



Thus, if "Ha and Hb are isomorphic and £b\a is a unitary 
operation, i.e. £b\a{') — U{-)U^ for some unitary opera- 
tor U, then SAlBi) ~ U^ {■)U . This is the familiar special 
case of the Heisenberg picture for unitary dynamics. 

In order to translate the Heisenberg picture into the 
conditional states formalism, first represent the POVM 
{Ey} by a conditional state Qy\B: the CPT map £b\a 
by a causal conditional state Qb\Ai and the POVM {Ey} 
by a conditional state Qy\a- Secondly, note that fig. [7] 
is just a special case of fig. 5(a) from i jHIIl in which the 



final region is classical, so the three conditional states are 
related by eq. ([5^ . i.e. 



(60) 



Qy\a — Trs {qy\bQb\a) 



In i jIIIII this was described as a belief propagation for- 
mula for causal conditional states because Qy\b was re- 
garded as defining a map from qb\a to Qy\Ai propagating 



15 



beliefs about B, conditional on A, into the future. How- 
ever, in the context of Heisenberg dynamics, we instead 
regard gB\A as defining a map from gY\B to gY\A, in the 
opposite direction to the flow of time. It remains to show 
that eq. (|60p is equivalent to the conventional description 
of Heisenberg dynamics in terms of dual maps. 



can be expressed in the conditional states formalism as 





Qy\a = Trs {qyibQbia) 



Conventional notation: 



E„ 



^A\B V^y 



FIG. 7: Dynamics in the Heisenberg picture. 

Theorem III. 17. Let Qb\a be the causal joint state that 
is Jamiolkowski isomorphic to the CPT map £b\a- Then, 

the action of the dual map £\\j^ on an operator Mbc is 
given by 



-A\B 



{E\.g®Ic) {Mbc) = TrB {MbcQb\. 



(61) 



Proof. By definition IIII.16[ the dual map to £b\a is the 
unique linear map £\\g that satisfies 

Tr^ (4|s {Nb) Ma) = Tr^ {Nb£b\a {Ma)) (62) 

for all operators Ma and Nb- Using the Jamiolkowski 
isomorphism, theorem IIII.6| the right hand side can be 
written as 

Trs {Nb£b\a {Ma)) = Tr^s {NbQb\aMa) (63) 

= 1:ta{^tb{NbQb\a)Ma). (64) 

The only way this can equal Tr^ i£*j^,g{NB) Ma] for 

all Ma is if £\^j^ {Nb) = Trs {NbQb\a)- Eq. dM]) then 
follows by linear extension to Hbc- D 

Combining this with eq. (|60p gives the following propo- 
sition. 

Proposition III. 18. Let Qbia be the causal conditional 
state associated with a quantum evolution described by 
the CPT map £b\a <md let gY\A o,'>^d Qy\b be the hy- 
brid conditional states associated with the POVMs {E^} 
and {E^}, such that {E^} is obtained from {E^} by the 
Heisenberg picture dynamics. The conventional descrip- 
tion of evolution in the Heisenberg picture, 



^v - ^A\B i^y 



(65) 



Qy\a = TrB {qyibObia) 



(66) 



This follows straightforwardly from theorem llll. 1 71 and 
theorem IIII.13I 

As in §11111 similar reasoning can be applied to other 
causal scenarios. Consider the special case of fig. IH] in 
which C is replaced by a classical variable Y. This is 
depicted in fig. [5] Two acausally related regions, A and 
B, are assigned a state pab and then the POVM {Ey] 
(alternatively represented by a conditional state Qyib) is 
measured on region B. This is the type of scenario that 
occurs in an EPR experiment. By measuring the region 
B, information is obtained about the remote region A 
and we are interested in how the state of A is correlated 
with the measurement outcome Y. 



Y \ pY\A = Trs {qy\bPb\a) 





Conventional notation: 



FIG. 8: Heisenberg evolution for a remote measurement. 

The belief propagation formula for this scenario, 
eq. (|55|) . gives 



Py\a — Trs {qy\bPb\a) 



(67) 



The components of the conditional state Py\a specify a 
POVM [E^]. This POVM can be thought of as describ- 
ing the effective measurement that gets performed on A 
when we actually measure region B. When combined 
with Pa, the conditional state Py\a specifies the ensem- 
ble of states for region A associated with the different 
measurement outcomes via pay — Py\a^ Pa- In terms of 
components, this is 



PYA = ^ \y) (yh 



'Pl^>l, 



(68) 



so the unnormalized state of A corresponding to the out- 
come y = y is 



P{Y = y)p^ = plE^p% 



(69) 



where P{Y = y) = Tr^ {E^Pa) - Tr^ {E^ Pb) is the 
Born rule probability for the measurement outcome Y = 

y- 

If Pb\a in eq. (|571) is thought of as specifying a map 
from gY\B to Py\a, then this map is analogous to a 
Heisenberg picture dynamics, except that the propaga- 
tion is across acausally related rather than causally re- 
lated regions. If ^b\a is the Jamiolkowski isomorphic 
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map to Pb\a then, by theorem IIII.171 the POVM ele- 
ments are related by 



Ei 



'A\B 



Et 



(70) 



Mathematically, the only difference between this and a 
Heisenberg picture map is that ^b\a ° Ta is completely 
positive rather than the map €b\a itself. This is just 
a reflection of the fact that we are propagating across 
acausally related, rather than causally related, regions. 

A similar expression to eq. (jTO]) has appeared in the 
context of quantum steering [33i], although there it is 
written in terms of the Choi, rather than Jamiolkowski, 
isomorphic map so there is a transpose in the expres- 
sion. We develop this application in W A3I To our 
knowledge, the connection to Heisenberg dynamics has 
not previously been recognized. 



K. Quantum Instruments as Causal Belief 
Propagation 

Describing a measurement by a POVM is adequate for 
determining the outcome probabilities of the measure- 
ment via the Born rule. However, one might also wish 
to describe how the state of a post-measurement region 
is correlated with the measurement result. In the con- 
ventional formalism, the transformative aspect of a mea- 
surement is represented by a quantum instrument. 

Definition III. 19. Given quantum regions A and B, 
and a classical variable Y, a quantum instrument is a set 
{Sy^"^} of CPT maps f^'^ : £ (Ha) ^ £ (His) such that 
the operators 



E^=£^^\^{Ib) 



(71) 



form a POVM. 



If A and B represent causally related regions before 
and after a measurement and Y represents the measure- 
ment outcome, then a quantum instrument can be used 
to determine the subnormalized state P{Y = y)py of B 
when the outcome is known via 



PiY ^ y)p^ ^ E^\^ {pa) 



(72) 



It can also be used to compute the outcome probabilities 
for the measurement by simply tracing over B in eq. (j72p 
to obtain 



(73) 



P{Y ^y)^ Trs (ff ^ {pa) 
Using eq. ((7T|) . this can be written as 

P{Y = y) = Trs (e^^^ (pa) 



Tib Ib£. 



?B\A 



-y {PA 
= TrA(£^^^^{lB)pA 

= Tr^ (E^Pa) , 



(74) 
(75) 
(76) 
(77) 



which is just the Born rule with respect to the POVM 
defined by the instrument. 

Whilst an instrument defines a unique POVM, each 
POVM corresponds to more than one quantum instru- 
ment. When performing a measurement of a particular 
POVM, any of the quantum instruments that correspond 
to it via eq. (j7ip may be obtained, depending on how 
the measurement is implemented. Conversely, the set 
of instruments corresponding to a given POVM exhaust 
the possible post-measurement transformations. This in- 
cludes, for example, the situation in which the system 
being measured is absorbed by the detector, which cor- 
responds to choosing the trivial Hilbert space for B, i.e. 

nB = c. 

Despite the freedom in choosing a quantum instru- 
ment, certain kinds of instrument are usually consid- 
ered particularly important. For measurements associ- 
ated with a projector- valued measure {H^}, the possi- 
ble quantum instruments include the Liiders-von Neu- 
mann projection postulate as a special case by taking 
T-Lb to have the same dimension as "Ha and £y {pa) = 
'^B\A (n^p^H^), where Tb\a is an isometry between T-La 

D I A 

and T-Lb- For general POVMs the rule By (pa) = 



Ei 



Ei 



which is sometimes taken as 



•-B\A \\^y ) PA y^y , 

a natural generalization of the projection postulate, is 
also included as a special case. 

A general measurement procedure, where there is an 
initial quantum region, a classical outcome of the mea- 
surement, and a quantum region after the measurement, 
is depicted in fig. 9(a) Note that the final quantum re- 



gion B may depend causally both on the initial quantum 
region A and on the outcome Y. 

In order to understand how quantum instruments are 
represented in the conditional states formalism, it is help- 
ful to first look at the classical analog. This is a scenario 
wherein a classical measurement is made upon a classi- 
cal system, which persists after the measurement, and in 
general the measurement procedure is permitted to dis- 
turb the state of the system. The variable R describes the 
system before the measurement and the variable S de- 
scribes the system after the measurement. This is in line 
with the quantum treatment, in which distinct regions 
are given distinct labels. The outcome of the measure- 
ment is again denoted by Y. This scenario is depicted in 
fig. |9(b)[ The measurement is then described by a con- 
ditional probability P{Y, S\R). This can equivalently be 
thought of as a set of subnormalized conditional prob- 
abilities for S given R, {P{Y — y,S\R)}, one for each 
outcome y, which is the analog of a quantum instrument. 
The joint distribution over Y and S, when the input dis- 
tribution is P{R), is then given by 



p(r,5) = ^p(r,5|i?)p(i?). 



(78) 



Furthermore, the set of response functions {P{Y — y\R)} 
associated with such a measurement is easily computed 
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from P{Y, S\R) by marginalizing over S, 

P{Y\R) = J2PiY,S\R). (79) 

s 

In the conditional states formalism, eqs. ((72|) and ((7T|) 
are replaced by straightforward analogs of eqs. ([75]) and 
([79|) for a causal hybrid state Qybia of a classical variable 
Y and quantum region B, conditioned on a quantum 
region A. 



PYB = TrA (£>yB|APA) 
QY\A = Trs (flysM) 



Conventional notation: 




P(F = y)p^ 



(a) 



tA|s 



(P/ 



{Ib 




P{Y,S) = Y.rP{Y,S\R)P{R) 
PiY\R) = Y.sP{Y,S\R) 



(b) 



FIG. 9: Measurements and their associated state-update 



rules, (a) A quantum instrument, representing how the state 
of a quantum persistent system changes after a measurement, 
(b) The classical analog of a quantum instrument, represent- 
ing how the state of a classical persistent system changes after 
a general (possibly disturbing) measurement. 



Theorem III. 20. Let Qyb\a be an operator on Hyab 
of the form 

QYB\A = X! ^y'^ ^2/|y «> QY=y,B\A, (80) 



where 

QY^yM\A ^ ff ^' E IJ') (^U ® \k) (jU' (81) 

are the Jamiolkowski isomorphic operators to maps 
£y : £,{Ha) -^ 2,{'Hb)- Then Qyb\A is a causal con- 
ditional state if and only if {£y } is a quantum instru- 
ment. 



The proof is similar to those of theorems IIII.6[ IIII.llI 
and nil. 1^ and is left to the reader. 



Theorem IIII.20I allows the transformative aspect of a 
quantum measurement to be represented by conditional 
states as follows. 

Proposition III. 21. LetQYB\A be the causal conditional 

state associated with the instrument {£y }. Then, 
when the measurement corresponding to this instrument 
is made with input state pA, the state-update rule in con- 
ventional notation is given by 



P{Y = v)p^ = 5f 1-4 [pa] . 



(82) 



In analogy with the classical expression in eg. (|78p . this 
can be expressed in the conditional states framework as 



PYB = TrA {qyb\aPa) 



(83) 



where pyb = X^u ^0^ — V) \v) {vW ® Py ■ Furthermore, 
the conventional expression for the relation between a 
POVM and a quantum instrument. 



can be expressed simply as 



A\B 



{Ib). 



Qy\a = Trs {qyb\a) , 
in analogy with the classical expression in eq. ((79 



(84) 



(85) 



The proof of eq. (I83p consists of applying proposi- 
tion nil. 71 in particular eq. (P5)) . to eq. (15^ for every 
value of Y. Eq. ((85)) follows from applying theorem lIII.171 
to each element of the instrument. 

Note that, from the perspective of the conditional 
states framework, the fact that there are many quantum 
instruments consistent with a given POVM is no more 
surprising than the fact that in classical probability the- 
ory there are many joint distributions consistent with a 
given marginal distribution. 

Finally, note that for a quantum instrument, the map 

^bia^Pa) — J2y^y (pa) is CPT and represents the 
non-selective state-update rule, i.e. the one that you 
should apply if you know that the measurement has been 
made but do not know its outcome. In the conditional 
states framework, if you know that a measurement asso- 
ciated with the causal conditional state Qybia has been 
performed, but you do not know the outcome, then you 
simply marginalize over Y to obtain the causal condi- 
tional state gB\A- Quantum belief propagation from A 
to B using gB\A is the non-selective state-update rule. 

Table [11] provides a summary of how dynamics, en- 
semble preparations and measurements are represented 
in the conditional states formalism as compared to the 
conventional formalism. 



IV. QUANTUM BAYES' THEOREM 

This section develops a quantum generalization of 
Bayes' theorem that relates the conditional states Pb\a 
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(qbia) and pa\b (qaib)- Formally, the quantum Bayes' 
theorem is the same for acausal and causal conditional 
states, so this represents a success in our project to de- 
velop a causally neutral theory of quantum Bayesian in- 
ference. In ijIV Al the quantum Bayes' theorem is intro- 
duced for two quantum regions. When written in terms 
of conventional notation, it reproduces the Barnum-Knill 
approximate error correction map [27||. ijIVBI special- 
izes to the hybrid case, which provides a rule for relating 
sets of states to POVMs. In conventional notation, this 
reproduces the definition of the "pretty-good" measure- 
ment [2J-[2g and a quantum analog of Bayes' theorem 
previously advocated by Fuchs [221]. As an application of 
the quantum Bayes' theorem, we develop a retrodictive 
formalism for quantum theory in piVCI in which states 
are evolved backwards in time. This demonstrates that 
the conditional states formalism is causally neutral with 
respect to the direction of time. Finally, in ijIVDl the 
acausal analog of the symmetry between prediction and 
retrodiction is discussed in the context of remote mea- 
surement. 



A. General Quantum Bayes' Theorem 

Recall that the classical Bayes' theorem is 
P{S\R)P{R) 



conditionals can be written as 



PiR\S) 



PiS) 



(86) 



which is derived by noting two expressions for the joint 
probability in terms of conditionals and marginals 



(87) 



P{R,S)=P{R\S)P{S) 
= P{S\R)P{R). 



Quantum conditional states can be used to derive a 
quantum analog of Bayes' theorem. For acausal condi- 
tional states, the two analogous expressions to eqs. ([57)1 
and IHHl are 



PAB ^ Pa\b*Pb 
= Pb\a* PA- 

Combining these gives 

Pa\b = Pb\a* {paPb^) 



(89) 
(90) 

(91) 



which is a quantum analog of Bayes' theorem for acausal 
conditional states. 

Classically, the distribution P{S) that appears in the 
denominator of Bayes' theorem is usually computed via 
belief propagation as P{S) = J2rP{S\R)P{R). This 
gives the alternative form of Bayes' theorem 



P{R\S) = 



P{S\R)P{R) 

J:rP{S\R)p{R)' 



(92) 



Similarly, noting that pB = T^a{pb\a*Pa) — 
TrA {pb\aPa), the quantum Bayes' theorem for acausal 



Pa|b = Ps|A*|pA [TrA (ps|aPa)] I 



(93) 



Now consider the case of two causally related regions. 
Suppose that a region A, described by the state pA, is 
mapped to _B by a CPT map £b\a that is Jamiolkowski- 
isomorphic to the causal conditional state gB\A- Re- 
call from ijIIIEl that the conditional state and input 
state can be used to define a causal joint state qab = 
Qb\a* PA- Now, we can try to define a new causal con- 
ditional state gA\B via an analogous decomposition of 
the causal joint state, namely, qab = Pa\b * Pb, where 
Pb = Tr^ {qb\aPa) is the output state of the channel. 
Equating the two expressions for the joint state qab, we 
obtain an expression for pa|Bj which can be regarded as 
the causal version of the quantum Bayes' theorem, 



Qa\b — Pbia* 



{paPb^) 



(94) 



and which can also be written as 

Qa\b = Qb\a * [pa [Tr^ {qbiaPa)] | ■ (95) 

In order for this to make sense, it must be checked 
that Qaib is indeed a valid causal conditional state. Tak- 
ing the partial transpose over B of eq. ((94)) . we have 

rji rji / rji - \ 

QjUB — Qb\a * [PAiPs') ]■ Given that pB is a valid 
state (positive and normalized), and given that the set 
of such states is mapped to itself by the transpose, p^ 
is also a valid state. Furthermore, given that Qb\a is a 
valid causal conditional state, and the fact that the set 
of such states are mapped to the set of valid acausal con- 
ditional states by the partial transpose, it follows that 
^B\A ^^ ^ valid acausal conditional state. But then, by 
the acausal quantum Bayes' theorem given in eq. (|9ip . 
^A\B ^^ * valid acausal conditional state, which implies 
that Qb\a is a valid causal conditional state. 

It is instructive to see how the causal and acausal ver- 
sions of Bayes' theorem appear in conventional notation. 

For the causal version, suppose that the causal con- 
ditional state Qb\a is associated, via the Jamiolkowski- 
isomorphism, with a quantum channel £b\a and its 
Bayesian inversion Qa\b is associated with a quantum 
channel J-a\b- Then, eq. ([M)) is equivalent to 



J". 



A\B 



(•)-pi{4|B[pZ(-)PB^]} 



Pa- 



where 



Pb = £b\a{.P/ 



(96) 



(97) 



The converse of this relation, wherein £b\a is expressed 
in terms of J^a\b, is obtained by simply exchanging A 
and B as well as £ and F. 

The proof that eq. (|94p translates into eq. (IMl) is 
straightforward. From the Jamiolkowski isomorphism 
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f theorem nil. 4[) . Ta\b{-) — Tr^ {qa\b{-))i which implies 
that 



^a\b{-) - Trs [pIpb^ Qb\aPb~' Pli-)) (98) 

^plTrsfpsM-Vs^tolAlpl, (99) 



where the first step follows from eq. (j94p and expanding 
the •-product, and the second step uses the cyclic prop- 
erty of the trace. Eq. ([TO]) then follows from the represen- 
tation of dual maps in terms of conditional states given 
in theorem nil. 171 

The map ^^415 is recognizable as the Barnum-Knill re- 
covery map for the channel £b\a |27[. This map is known 
to achieve near-optimal quantum error correction in situ- 
ations where the input state and channel are known. To 
our knowledge, its connection with Bayesian inversion 
has not previously been noted. It suggests that the best 
way of thinking about Ta\b is not as an error correction 
map, but rather as a means for accurately capturing your 
beliefs about region A given your beliefs about region B. 

A similar result holds for the acausal case. Suppose 
that the linear map that is Jamiolkowski-isomorphic to 
the acausal conditional state Pb\a is ^b\a a-nd the one 
associated to its Bayesian inversion Pa\b is ^a\b (recall 
that only ^b\a°Ta and ^a\b°Ta are CPT maps). Fol- 
lowing the same reasoning used above, eq. (pij) can be 
rewritten as 



^A\B{-)^pi{<^\B[p~Bh-)PB^]}pi 



where 



PB = '£b\a(pa)- 



(100) 



(101) 



B. Bayes' Theorem for Quantum-Classical Hybrids 

For quantum-classical hybrids, there are two versions 
of Bayes' theorem, depending on whether it is the condi- 
tioned region or the conditioning region that is classical. 
Recall that the mathematical form of hybrid conditional 
states does not depend on whether they are causal or 
acausal, and a is the notation used for results that do 
not depend on the causal interpretation. The two ver- 
sions of Bayes' theorem are then: 



and 



o'x\a = <^a\x * {pxPa ) 



0'A\X = <^X\A * {pAPx) 



(102) 



(103) 



where pA = Tr^ {'Ja\xPx) and px = Tr^ (ctxiaPa)- 

A hybrid joint state uxa may be decomposed into a 
hybrid conditional state and a reduced state via eq. pT|) 
in two distinct ways: either in terms of a classical reduced 
state and a conditional state that is conditioned on the 
classical part. 



CTXA = TAIX * Px, 



(104) 



or in terms of a quantum reduced state and a conditional 
state that is conditioned on the quantum part, 

crxA = crx\A* PA- (105) 



The hybrid Bayes' theorems of eqs. (|102p and (|103p give 
the rules for converting between these two decomposi- 
tions. 

To see how these two decompositions appear in conven- 
tional notation, recall from theorem IIII. 1 II that a general 
hybrid conditional state aA\x is of the form 



f^AlX 



E 



X) ix 



X®Px, 



(106) 



where each p'^ is a normalized density operator on "Ha, 
and from theorem IIII. 131 that <Jx\a is of the form 



fX|A 



El 



\x 



'Ei 



(107) 



where {E^} is a POVM on A. Finally, recall that 
the classical state px is of the form px — J2x ^(-^ ~ 



x)\x) 



IX 



where P{X) is a classical probability distri- 



bution. 

Eq. (|104p therefore gives a decomposition of a joint 
state in terms of a set of states and a probability distri- 
bution via 



OXA =Y,PiX = ^) \^) {Ax ® Pt 



(108) 



and eq. (|105p gives a decomposition in terms of a POVM 
and a state for A via 



axA = Y,\x){x\x®plE^pi 



(109) 



In terms of components, the Bayes' theorem of 
eq. (|102p is a rule for determining a POVM from a prob- 
ability distribution and a set of states, whilst eq. (|103p is 
a rule for determining a set of states from a POVM and 
a state on A. These rules are: 



and 



Tta (E^Pa) ' 



E^^P{X = x)pj,^P^Pa' 



(110) 



(111) 



where pA = Y.x ^i^ = ^)Px- 

These rules have appeared numerous times in the lit- 
erature, e.g. [3, 0, [231 ■ In the context of distinguish- 
ing the states in an ensemble, the POVM defined by 
eq. (Illll) is known as the "pretty good" measurement 
[24| - [26j . Eq. (|110p is a rule previously advocated by Fuchs 
as a quantum analog of Bayes' theorem [22] • The fact 
that these rules are all special cases of a more general 
quantum Bayes' theorem goes some way to explaining 
their utility. 
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C. Retrodiction and Time Symmetry 

As an application of the quantum Bayes' theorem, we 
use it to develop a retrodictive formalism for quantum 
theory, in which states are propagated backwards in time 
from late regions to early regions. This is operationally 
equivalent to the usual predictive formalism, in which 
states are propagated forward in time from early regions 
to late regions. The retrodictive description is particu- 
larly useful if you acquire new information about the late 
region and wish to update your beliefs about the early 
region, for instance, when you learn about the output of 
a noisy channel and wish to make inferences about its 
input. This situation will be considered in W Al 

Barnett et al. have previously proposed a formalism 
for retrodictive inference in quantum theory [28|-l30|. For 
unbiased sources — for which the ensemble average of 
the prepared states is the maximally mixed state — their 
formalism is identical to the one presented here, and the 
quantum Bayes' theorem provides it with an intuitive 
derivation. For biased sources, their formalism differs 
from ours. The one we propose has the advantage that it 
can be derived as a special case of our general formalism 
for quantum Bayesian inference and thereby retains a 
closer analogy with classical Bayesian inference. 

As emphasized by the de Finetti quote in the intro- 
duction, the rules for making classical probabilistic infer- 
ences about the past are the same as those for making 
inferences about the future. By analogy, in the condi- 
tional states formalism, we would expect to be able to 
propagate quantum states from future regions to past re- 
gions via the same rules used to propagate them from 
past regions to future regions. 

If the state pA of an early region is mapped to the state 
Pb of a later region by a CPT map £b\a then 



Pb = Tta {gB\APA) 



(112) 



where gB\A is the Jamiolkowski isomorphic causal condi- 
tional state to £b\a- By construction, the causal condi- 
tional state gA\B defined by Bayes' theorem in eq. ([M)) 
satisfies qab = Qb\a* Pa = Qa\b'*^ Pb, and this causal 
joint state has pA and pB as its marginals, so we have 

PA = Trs {qaibPb) (113) 

In conventional notation, this is equivalent to 

PA = £'a% (pb) , (114) 

where 

£a\b (•) ^ pi4|B {pb^ (■) Pb^) pi (115) 

is the map that is Jamiolkowski isomorphic to gA\B and 
the superscript '''^*'' is used to indicate that this is a retro- 
dictive map that propagates states from future to past 
regions. 





Predictive expression: 
QXY = TiA {gY\AQA\x) * Px 
Conventional notation: 
P.,y = Tta [E^p-^) P{X = x) 
Retrodictive expression: 
QXY = TrA {ex\AQA\Y) * PY 
Conventional notation: 



Px 



= TrA [E^ 



A.retr „A,rctr 



p^'-"-) P(Y = y) 



FIG. 10: A prepare-and-measure experiment in which a 
preparation procedure is followed by a measurement. We 
are interested in computing the joint probability distribu- 
tion P{X, Y) of the preparation variable and the measure- 
ment outcome. For compactness, px,y ~ P{X — x,Y = y) is 
used for the expressions in conventional notation. 



For comparison with the retrodictive formalism of |28[ , 
consider a simple prepare-and-measure experiment, as 
depicted in fig. [lOl 

In the predictive description, the preparation proce- 
dure is characterized by a probability distribution P{X), 
which can alternatively be represented by a diagonal 
state Px, and by a set of states p^, which can alterna- 
tively be represented by a causal conditional state gA\x- 
The measurement is characterized by a POVM {-E"^}, 
or alternatively a causal conditional state qyia- This is 
predictive because the conditional states, gA\x and gY\A, 
are conditioned on regions in their immediate past. 

In order to calculate the joint distribution P{X,Y), 
or equivalently the causal joint state gxY, we need to 
use the belief propagation formula for causal conditional 
states as given in eq. ([5^ . This proceeds as follows: 



1. Propagate the causal conditional state of A given X 
into the future to obtain gY\x — Tr^ {gY\AgA\x)- 

2. Combine the causal conditional state for Y given 
X with the state for X to obtain gxY = Py\x*Px- 

Combining these steps, the predictive expression for 
gxY is 



gxY = TrA {gY\AgA\x) * px- 



(116) 



In conventional notation, if the states {p^} are the 
components oi gA\x and the elements of the POVM {Ey} 
are the components oi gY\A, then eq. (|116p is equivalent 
to 



PiX^x,Y = y) 



TrA (E^P^) P{X 



x). (117) 



A retrodictive description of the same experiment can 
be given, involving states of regions conditioned on re- 
gions in their immediate future, i.e. pY, gA\Y and gx\A- 
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These correspond, respectively, to a probability distri- 
bution P{Y), a set of states {Py'""^'}, and a POVM 
{i?^'''°*''}. Note that, in contrast to the predictive de- 
scription, the measurement is now being described by 
a classical probability distribution and a set of states, 
which we call retrodictive states, whilst the preparation 
is being described by a POVM, which we call the retrodic- 
tive POVM. The retrodictive calculation of gxY proceeds 
as follows: 

1. Propagate the causal conditional state of A given 

Y into the past to obtain gx\Y = Tr^ {0x\aQa\y)- 

2. Combine the causal conditional state for X given 

Y with the state for Y to obtain gxv = Px\y * Py- 

Combining these steps, the retrodictive expression for 
gxY is 

QXY = ^rA{gx\A0A\Y)'*'PY- (118) 

In conventional notation, this is equivalent to 

PiX^x.Y^y)^ 

Tr^(i?,^^-*V^^-*OP(r = y). (119) 

The retrodictive and predictive descriptions of the ex- 
periment are related by the quantum Bayes' and quan- 
tum belief propagation via 



we have 



Qx\A = Qa\x* {pxPa^) 
where pA = Trj^ ipA\xPx) , 



and 



where 



Qa\y = Qy\a-^ {paPy^) 
Py = Tr^ {py\aPa) ■ 



(120) 
(121) 



(122) 
(123) 



In conventional notation, these equations are equiva- 
lent to 



where 



j^A,et. ^ p(^ ^ X)p-^^pip^' 



^x 1 



(124) 
(125) 



and 



where 



A.rctr ^ Pa^v Pa 
Py P{Y^yy 

P{Y = y) = Trs {E^Pa) 



(126) 
(127) 



Eqs. (|120p and (|122p can be used to prove that the 
predictive and retrodictive expressions for gxY do indeed 
give the same causal joint state. Starting from eq. (J118I) . 



QXY =TrA {qx\aQa\y) * Py 
=TT:A{{gA\x* [pxPa^]) 

{QY\A'^ [PAPY^])]i^ Py 

(11—1 1 — 

PYPxPA^QA\XPkPA 

PAPY^eY\APAPY''PY 

=TrA [PyPxPa'' QA\XPkPY'' eY\APA 



(128) 
(129) 
(130) 

(131) 
(132) 
(133) 



Since py commutes with px, PA and gA\x, the py term 
can be moved forward to cancel with py ^ term. The p^ ^ 
term can be made to cancel with the last p\ term via the 



cyclic property of the trace. This yields 
Tr^ [p\gA\xp\QY\A 



gxY 



= PiTr^ {qa\xQy\a) Px 
= Tr^ {qa\xQy\a) *Px, 



(134) 

(135) 
(136) 



where, in the second line, we have used the fact that 
Px commutes with Pyia- Finally, applying the cyclic 
property of the trace gives 



gxY = TrA {gY\AQA\x) * Px, 



(137) 



which is the predictive expression for gxY- 

We are now in a position to show that our formalism 
coincides with that of Ref. [2^ for the case of unbiased 
sources, where pA = lA/d with d the dimension oiJiA- In 
this case, the definitions of retrodictive states and retro- 
dictive POVMs in di were 



£ 



A,rctT 



dP{X = x)p: 



'x ) 



and 



A,rctr 
Py 



dP{Y = y) ■ 



(138) 



(139) 



which are easily seen to be special cases of eqs. (|124l) and 

(HMD. 

Finally, note that the analysis above can be extended 
to deal with the scenario depicted in fig. Ill[ in which 
there is an intervening channel £b\a between the prepa- 
ration and measurement so that the measurement is now 
made on region B. In fact, by making use of the rule 
for propagating conditional states given in eq. (|52|) (or 
equivalently the Heisenberg dynamics given in eq. (|60|)'). 
the extra region B can be eliminated from the description 
by defining 



Qy\a = Tr^ {qy\bQb\a) , 



(140) 



which determines the effective measurement on A that 
is performed by actually measuring region B. The three 
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operators px, Qa\x and Qy\a then provide a predictive 
description of a simple prepare- and- measure experiment, 
and so the previous analysis applies. 

Specifically, substituting eq. (|f40p into eq. (|ff6p gives 
the predictive expression for qxy as 

QXY ="^'cab{qy\bQb\aQa\x)'^ px, (141) 

or in conventional notation 

P{X = x,Y = y) = 

T^B{E^£B\A{pi))P{X^x). (142) 

Predictive expression: 

QXY = TrB (e^iBTrA (^BiAfiAix)) *px 

Conventional notation: 

p^,y = Tib (£f fsi.4 (p^)) P(X = a;) 



X 



Retrodictive expression: 

QXY = TiA (exi^Trs (eAiseiBiy)) * P>' 

Conventional notation: 

P.,y = TrA (i5i'^'-^^*^£:.X|B (pf "■^^O) nY = y) 



FIG. 11: A prepare-and-measure experiment with an inter- 
vening channel. We are interested in computing the joint 
probability distribution P{X, Y) of the preparation variable 
and the measurement outcome. For compactness, px,y = 
P{X = x,Y = y) is used for the expressions in conventional 
notation. 

Similarly, the retrodictive expression is obtained from 
eq (|118p by substituting qj^\y — Tr^ {qa\bQb\y), where 
qa\b and Qb\y are obtained from qb\a and Qy\b by 
Bayes' theorem. This gives 

Qxy = Tr^B {qx\aQa\bQb\y) * Py, (143) 

or in conventional notation 



P{X^x,Y^y) 
TvA [e^ 



^''■'='^^A|sK"'*"))^(>^ = y)- (144) 



To sum up, the formalism of quantum conditional 
states shows that, just as in the classical case, the rules 
of quantum Bayesian inference do not discriminate be- 
tween prediction and retrodiction. Specifically, the rules 
are the same regardless of whether the propagation is 
in the same or the opposite direction to the causal ar- 
rows. This reveals an important kind of time-symmetry 



that is not apparent in the normal quantum formalism. 
More importantly, it shows that a formalism for quantum 
Bayesian inference can be found that is blind to at least 
this aspect of the causal structure. 



D. Remote Measurements and Spatial Symmetry 

Many of the novel features of quantum theory exhibit 
themselves in the correlations that can be obtained be- 
tween local measurements on a pair of acausally related 
regions. These include Einstein-Podolsky-Rosen correla- 
tions and Bell correlations. Inferences about such mea- 
surements can be treated using a formalism that is al- 
most identical to the predictive and retrodictive expres- 
sions for prepare-and-measure experiments given above. 
One simply has to substitute the formula for propagat- 
ing a causal conditional state across acausally related 
regions, eq. ([55]). for the formula for propagating them 
across causally related regions, eq. (|52t . used above. 

For two acausally related regions, A and B, it is self 
evident that there is complete symmetry between prop- 
agation from one region to another and back again, i.e. 
if 



then 



Pb = Tr^ {pb\aPa) 



PA = Trs {pa\bPb] 



(145) 



(146) 



The two conditional states, Pbia and Pa\b are related by 
the quantum Bayes' theorem, so Bayes' theorem allows 
the direction of belief propagation to be reversed. 

To see this spatial symmetry at work, consider the sce- 
nario of measurements being implemented on a pair of 
acausally related regions, A and B, as depicted in fig. [12] 



Rightward belief propagation: 

Pxy — 

TrAs {qy\bQb\aQa\x) * px 
Leftward belief propagation: 
Pxy — 

Tr/is {gx\AQA\BQB\Y) * Py 





FIG. 12: Measurements on a pair of acausally related regions, 
rightward and leftward belief propagation. 

In the conditional states formalism, the region AB is 
assigned a joint state pab, and the measurements on A 
and B are represented by causal conditional states gx\A 
and gY\B- The joint distribution P{X,Y) over outcomes 
is given by the components of the acausal joint state pxY 
via 



Pxy = Tr^B {{gx\A9Y\B) Pab) 



(147) 
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These correlations can alternatively be calculated by 
propagating beliefs about A conditioned on X to beliefs 
about Y conditioned on X, and also by propagating be- 
liefs about B conditioned on Y to beliefs about X condi- 
tioned on Y. These representations are analogous to the 
predictive and retrodictive expressions for prepare-and- 
nieasure experiments discussed in the previous section. 
We shall refer to these as rightward and leftward belief 
propagation respectively. 

It may seem convoluted to calculate pxY via right- 
ward or leftward belief propagation, when eq. (|147|) al- 
ready gives a simple expression for it. However, it is 
important to understand how to propagate beliefs across 
acausally related regions in order to deal with the sit- 
uation in which you obtain new information about one 
region and wish to make inferences about the other. This 
is exactly what happens in the analysis of an EPR exper- 
iment. This problem is known as quantum steering and 
will be considered in W A3I 

First, consider rightward belief propagation. The aim 
is to rewrite eq. (|147p in terms of p^i^, the state of B 
conditioned on the region to its left. For greater sym- 
metry with the prepare-and-measure case, we also use 
px and qa\x to describe the left hand wing of the ex- 
periment, whilst retaining Pyib ioi the right. Then, we 
write pab = Pb\a* PA and note that, by Bayes' theorem. 



Qx\A = gA\x -* (pxpa^) ■ 

Substituting these into eq. (|147p gives 



(148) 



PXY 



= TrAi3 ( [gA\x * (pxPa^)] Qy\b [pb\a * Pa]) ■ (149) 
Expanding the •-products gives 



PXY 



Tr^s [PxPa'' QaixPxPa" 0y\bPaPb\aPa) ■ (150) 

All of the Pa terms can be canceled by using the fact 
that they commute with gY\B and px, and by using the 
cyclic property of the trace. Then, we have 



(151) 

(152) 
(153) 



PXY = TrAB yPxgA\XPx^Y\BPB\A 

= Ttab [qy\bPb\aPxSa\xPx 

= Tr^B {gY\BPB\AQA\x) *Px, 



where we have used the cyclic property of the trace 
and the fact that px commutes with Pb\a and Qy\b- 
Eq. (|153p has the same form as the predictive expres- 
sion for a prepare-and-measure experiment with an in- 
tervening channel given in eq. (J14ip . except that, in this 
case, Pb\a is acausal. We can also use eq. ([55]) to define 
Py\a — Tr^ {qy\bPb\a)i which represents the effective 
measurement on region A that is made by measuring re- 
gion B. Using this, region B can be eliminated from 



eq. (|153[) to obtain 

PXY = Tr^s {py\aQa\x) * Px- (154) 

This is similar to eq. (|116p . and, in fact, is mathemat- 
ically identical to it because Py\a is a hybrid condi- 
tional state, so its mathematical form does not depend on 
whether it is acausal or causal. Eq. (|154p is a useful form 
to use when we want to consider the effect of measuring 
B on the remote region A, as in quantum steering. 

For leftward belief propagation, a similar analysis gives 
the expressions 

PXY ^T^^AB {gx\APA\B0B\Y)'*' PY, (155) 

which is analogous to eq. (|143p and 



PXY = Tr^ (px\a9b\y) *Py, 



(156) 



which is analogous to eq. (jllSp . 

As with prediction and retrodiction, there is complete 
symmetry between leftward and rightward belief propa- 
gation, and there is a strong symmetry between causal 
and acausal belief propagation in general. This repre- 
sents progress towards a theory of quantum Bayesian in- 
ference that is completely independent of causal struc- 
ture. 



V. QUANTUM BAYESIAN CONDITIONING 

Classically, Bayesian conditioning is used to update 
probabilities when new data are acquired. Specifically, 
if you are interested in a variable R, and you learn that 
some correlated variable X takes the value x, then the 
theory of Bayesian inference recommends that you should 
update your probability distribution for R from the prior 
P[R) to the posterior P^{R) = P{R\X = x) ^. 

Bayesian conditioning can be viewed as a two-step pro- 
cess. First, the observation that X = x causes you to 
update your probability distribution for X from P{X) to 
Px{X), where 



Px{X 



5x.. 



(157) 



Secondly, assuming that the observation of X does 
not cause you to change your conditional probabilities 
P{R\X), the new probability distribution for R is ob- 
tained via belief propagation as 

P,(i?) = ^P(i?|A)P,(X) (158) 

X 

= P{R\X^x). (159) 

This two step decomposition of conditioning has been 
emphasized by Richard Jeffrey [3^, [3^] who showed that 
whenever an observation causes the value of a random 
variable to become certain, and that all probabilities con- 
ditioned on that random variable are unchanged by the 
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observation, then the change in the probabihty distribu- 
tion can be represented as Bayesian conditioning. 

The reason for emphasizing this decomposition is that 
Jeffrey was interested in situations in which an obser- 
vation does not cause you to beheve that some variable 
takes a precise value. As an example of this, adapted 
from [37|, suppose that X is the color of a jellybean, 
which has possible values 'red', 'green' and 'yellow', and 
that R is the flavor, which has possible values 'cherry', 
'strawberry', 'lime' and 'lemon'. Suppose that initially, 
your probability distribution for X, P{X), assigns a 
probability 1/3 to each color, and that your observation 
consists of viewing the jellybean in the light of a dim 
candle. This might not be enough for you to become cer- 
tain about the color of the jellybean, but it may reduce 
your uncertainty somewhat. For example, you may now 
think it reasonable to assign a probability distribution 
^post(-^) that gives probability 2/3 to X = 'red' and 1/6 
each to X = 'green' and X — 'yellow'. Assuming that 
the observation does not cause your conditional probabil- 
ities P{R\X) to change, Jeffrey shows that your posterior 
probability distribution for R is obtained by belief prop- 
agation via 



-Ppo.t(i?) = ^P(i?|X)Ppost(X), 



(160) 



which is known as Jeffrey conditioning. 

Many orthodox Bayesians reject the generalization to 
Jeffrey conditioning and maintain that the rational way 
to update probabilities in the light of data is always via 
Bayesian conditioning, not least because most of the ap- 
paratus of Bayesian statistics depends on this. This po- 
sition can be defended by insisting that the sort of sit- 
uations described above should really be handled by ex- 
panding the sample space to include statements about 
your perceptions. One can show that Jeffrey condition- 
ing can always be represented as Bayesian conditioning 
on a larger space in this way. Counter to this, Jeffrey 
argues that it is not realistic to construct such a space, 
since you do not actually have precise descriptions of your 
perceptions. This argument has been eloquently put by 
Diaconis and Zabell j38l] : 

For example, suppose we are about to 
hear one of two recordings of Shakespeare on 
the radio, to be read by either Olivier or Giel- 
gud, but are unsure of which, and have a prior 
with mass 1/2 on Olivier, 1/2 on Gielgud. Af- 
ter hearing the recording, one might judge it 
fairly likely, but by no means certain, to be 
by Olivier. The change in belief takes place 
by direct recognition of the voice; all the in- 
tegration of sensory stimuli has already taken 
place at a subconscious level. To demand a 
list of objective vocal features that we condi- 
tion on in order to affect the change would be 
a logician's parody of a complex psychological 
process. 



The debate over whether Jeffrey conditioning should 
be subsumed into Bayesian conditioning is somewhat 
analogous to a similar argument in quantum theory 
about whether POVMs should be regarded as fundamen- 
tal, since they can always be represented by projective 
measurements on a larger Hilbert space via Naimark ex- 
tension [39|. 

In the conditional states formalism, the quantum ana- 
log of Jeffrey conditioning is straightforward. If an ob- 
servation causes you to change the state you assign to a 
region A from pA to p^°^ , and if your conditional state 
for another region B, given A (which will either be an 
acausal state Pb\a or a causal state gB\A) is unchanged, 
then your posterior state for B is determined by belief 
propagation via either 



pT' = Tr. 



{pb\aP^a 



post\ 



post rr-i / pOSt\ 

Pb = Tr^ \Qb\aPa ) 



(161) 



(162) 



depending on whether A and B are acausally or causally 
related. 

The question of whether there is a quantum analog of 
Bayesian conditioning is more subtle, as it depends on 
whether there is a posterior quantum state for A that is 
analogous to having certainty about the value of a clas- 
sical variable, i.e. the point measure Px{X =^ x') = S^^x'- 
Arguably, a pure state could play this role, since it rep- 
resents the smallest amount of uncertainty that one can 
have about a quantum region. However, unlike classi- 
cal point measures, pure states still assign probabilities 
other than and 1 to fine-grained measurements, e.g. 
measurements in a complimentary basis, and there are 
good reasons to believe that, even if they represent max- 
imal knowledge, that knowledge is still incomplete [40]. 

We will not pursue this question further here, but in- 
stead focus on the case of a hybrid region XA. If the 
the data is the classical variable (so that one can indeed 
learn its value) then Bayesian conditioning has a straight- 
forward generalization. Upon learning that X = x, the 
state of A should be updated via 



PA -> CrA\x=x, 



(163) 



Recall that the general form of a state conditioned on 
a classical variable is <ta\x — J2x Px ^ 1^) {^\x^ where 
{Px} is a set of normalized density operators and that 
fA|x=2; is simply our notation for p^. The elements of 
this set are called the components oi a^x: so we may 
describe Bayesian conditioning as replacing aA with one 
of the components of aA\x- 

Note that, as in the classical case, conditioning can be 
viewed as a two-step process, wherein first the state of X 
is updated to the diagonal density operator for the point 
measure, p^ = \x) {x\^, and then belief propagation is 
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used to determine the posterior state of A, as follows 



^ 


= Trx (cTAlxPx) ' 


(164) 




= {x\x'^A\x \x)x, 


(165) 




— 0'A\X=xj 


(166) 



where we've made use of the fact that aA\x — 
^x'^A\x=x <8) \x) {x\. Note that this holds regardless of 
the causal relation between A and X because a hybrid 
conditional such as (Ja\x does not distinguish between 
these causal possibilities. 

Recall that the rule for propagating unconditional be- 
liefs about X to beliefs about A is pa = Trx {<Ja\xPx)- 
In conventional notation, this translates to pA = 
^jf P(X = x)aA\x=x- Therefore, Bayesian condition- 
ing is a process by which a state pA is updated to an 
element (Ta\x=x within a convex decomposition of pA- 



A. Examples of Quantum Bayesian Conditioning 

In this section, we consider several examples of 
Bayesian conditioning, based on the different causal sce- 
narios discussed in ijIIII and ijIVI In all these cases, con- 
ditioning the state of a quantum region on a classical 
variable is the correct thing to do in order to update the 
predictions or retrodictions that can be made about other 
classical variables correlated with the region. In partic- 
ular, in W A3[ we develop the application to quantum 
steering, showing that the set of states of a region that 
can be steered to by making remote measurements can be 
expressed compactly in terms of conditioning and belief 
propagation. 



1. Conditioning on a Preparation Variable 

C onsid er again the preparation scenario depicted in 
fig. 3(a) wherein a quantum region A is prepared in 
one of a set of states {gA\x=x} depending on the value 



of a classical variable X with prior probability distri- 
bution P{X). This can alternatively be described by 
a conditional state gA\x and a diagonal state px- In 
this case, Bayesian conditioning of A on the value x of 
X corresponds to updating from the ensemble average 
state PA — J2x^i-^ ~ x)gA\x=x to the particular state 
0A\x=x corresponding to the value x oi X that actually 
obtains, which is clearly a reasonable thing to do. 

The operational significance of this conditioning be- 
comes apparent by considering a measurement made on 
region A, described by a conditional state gyiA- We now 
have a prepare-and-measure experiment, as depicted in 
fig. [To] and eq. (J116I) gives the predictive expression for 
the joint probability distribution for this experiment as 



From this, we can compute the marginal probability for 
Fas 

PY = Tr^ {gY\APA) , (168) 

where pA = Trx (gAlxPx) is the ensemble average state. 
The conditional gY\x — gxY * Px is given by 

Py\x = Tr^ {gY\AgA\x) , (169) 

from which it follows that 

Py\x=x = TrA {gY\AgA\x=x) ■ (170) 

The transition from eq. (I168p to eq. (|170p is just classi- 
cal Bayesian conditioning of the probability for Y on the 
value of X . Both expressions are representations of the 
Born rule in terms of belief propagation, and eq. (jl70p 
can be obtained from eq. (jl68p by replacing pA with 
gA\x=xj which is just quantum Bayesian conditioning. 
Thus, quantum Bayesian conditioning on a preparation 
variable can be used as an intermediate step in updating 
the probability distribution for a measurement outcome 
by classical Bayesian conditioning. 

Nothing changes if we consider the slightly more com- 
plicated scenario where there is an intermediate chan- 
nel between the preparation and measurement, as de- 
picted in fig. [TT] This is because, as shown in i jIVCl the 
joint probability is still given by eq. (jl67p . where now 
gyiA = Trs {gY\BgB\A) describes the effective measure- 
ment on A that corresponds to the actual measurement 
made on the later region B. By similar reasoning, there 
could be an arbitrary number of time-steps between the 
preparation and measurement and conditioning the state 
of A on the preparation variable would still be the correct 
way to update the Born rule probabilities for Y. 

2. Conditioning on the Outcome of a Measurement 



Recall the measurement scenario depicted in fig. 4(a) 



A measurement with outcomes labeled by the classical 
variable Y is implemented upon a quantum region A. In 
the predictive formalism, this experiment is described by 
an input state pA and a causal conditional state gY\A 
(describing the measured POVM). To condition A on a 
value y of Y, Bayes' theorem must be applied to deter- 
mine gA\Yi and then the component gA\Y=y gets picked 
out by conditioning. This gives 

* (pAPYly) , (171) 



PA — ^ gA\Y=y — gY=y\A 

or in conventional notation 



PA -^ p. 



A.rctr 



TvA (E^Pa) ' 



(172) 



gxY 



Tr^ {gY\AgA\x) * Px- 



(167) 



The state gA\Y=y represents the state of a region A prior 
to the measurement, given the outcome of the measure- 
ment, i.e. it is a retrodictive state. Its operational sig- 
nificance is that it allows one to make inferences about 
variables involved in the preparation of A. 
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To see this, consider again the prepare-and-measure 
experiment of fig. [TU] Recall from ijIVCI that there is 
complete symmetry between the predictive and retrod- 
ictive expressions for a prepare-and-measure experiment 
under exchange of the preparation variable X with the 
measurement variable Y. Thus, everything that was said 
regarding the probability distribution for the measure- 
ment outcome in the previous example, applies here to 
the probability distribution for the preparation variable. 
In particular, the marginal probability distribution for X 
is 



px = Tr^ (qxiaPa) , 
and conditioning this onY = y gives 

PX\Y=y = Tr^ {gx\AQA\Y= 



(173) 



(174) 



Both of these expressions are belief propagation represen- 
tations of the Born rule, with respect to the retrodictive 
POVM for X. Thus, conditioning A on the outcome of 
the measurement can be used as an intermediate step 
in updating the probability distribution for the prepara- 
tion variable in light of the measurement outcome. The 
retrodictive state appearing in the retrodictive Born rule 
simply gets updated by Bayesian conditioning. 

As in the previous example, nothing changes if there 
are intermediate channels between the preparation and 
measurement. We can simply use conditional belief prop- 
agation to eliminate the additional regions. 



3. Conditioning on the Outcome of a Remote 
Measurement: Quantum Steering 

Finally, consider the case of a measurement made on 
a region B that is acausally related to a region A, as 
depicted in fig. El This experiment is described by an 
acausal joint state pab, and a conditional state gym, 
corresponding to the POVM measured on B. We are 
interested in how the state of the remote region A is up- 
dated when we learn the outcome of the measurement 
made on B. The updated state of A could then be used 
to predict the outcome of a measurement made on A, 
corresponding to a causal conditional state gx\A- This 
scenario is depicted in fig.[T21 The results of fJIVD] estab- 
lish that the mathematical description of this experiment 
is formally equivalent to a prepare-and-measure exper- 
iment with an intervening channel, the only difference 
being that the causal conditional state gB\A is replaced 
by an acausal conditional state Pb\a- This symmetry 
is enough to establish that, as in the previous example, 
conditioning the state oi A oiiY — y must be the correct 
way of updating the Born rule probabilities for X. 

However, it is worth developing this example in a lit- 
tle more detail, since the "remote collapse" at A that 
occurs upon measuring B is at the core of the EPR argu- 
ment. This has led to a study of the ensembles of states 
for A that can be obtained by measuring B, a problem 



known as quantum steering [231 . I33l . |41|-|46j . The condi- 
tional states formalism provides an elegant approach to 
this problem. 

Recall from i ^IVDl that the joint probability for X 
and Y can be computed via leftward belief propagation, 
which yields cq. (|156l) . i.e. 



PXY = Tr^ (gxiAPAlv) '*' PY ■ 



(175) 



This formula is obtained by first applying Bayesian in- 
version to gviB to determine 



gB\Y = gY\B*{PBPY^) 1 



(176) 



where py = Tr^ {gY\BPB) is the Born rule probabil- 
ity distribution for the measurement outcome. Then, 
conditional belief propagation is used to obtain paiy — 

Trs {p A\Bg B\Y)- 

Eq. (|175l) is formally equivalent to the retrodictive ex- 
pression for a prepare-and-measure experiment, so the 
rationale for conditioning is exactly the same as in the 
previous example. Specifically, the marginal probability 
distribution for X is 

px = TrA {gx\APA) , (177) 

and conditioning on Y — y gives 

PX\Y=y = TrA {gx\APA\Y=y) ■ (178) 

Thus, conditioning the state of A on y = y is an ap- 
propriate intermediate step for updating the Born rule 
probabilities for X. 

The following proposition summarizes this result and 
translates it into conventional notation. 

Proposition V.l. Let pab be the joint state of two 
acausally related regions. Suppose that the POVM corre- 
sponding to the conditional state gY\B is measured on B 
and the outcome Y = y is obtained. Then, the state of re- 
gion A should be updated from the prior pA = Trs (pab) 
io PA\Y=yj where 



PA\Y = Ttb {pA\BgB\Y) 

Pa\b = Pab -^ Pb^ , and 

gB\Y = gY\B*iPBPY^)- 



(179) 



(180) 



Let the components of the conditional state gY\B be the 
elements of the POVM {Ey\ and let ^a\b be the map 
that is Jamiolkowski-isomorphic to Pa\b- Then, the up- 
dated state of A is 



Pt 



-A\B 



iPy), 



where 



Py 



^1/2 pB ^1/2 
B _ Pb ^y Pb 



Ttb {EBpB) ' 



(181) 



(182) 
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In the above analysis, the method used to compute 
PA\Y=y is to first apply Bayes' theorem to Qy\b and then 
apply conditional belief propagation. This is the acausal 
analog of performing the calculation in the retrodictive 
formalism. However, we could equally well apply belief 
propagation first to obtain 



Py\a = Trs {qy\bPb\a) 



(183) 



and then apply Bayes' theorem to obtain 

PA\Y=y = PY=y\A * {pAPy^) ■ (184) 

This is the acausal analog of performing the calculation 
in the Heisenberg picture, and it is straightforward to 
check that it gives the same result. 

In conventional notation, this amounts to first deter- 
mining the effective POVM on A that is performed when 
B is measured via 






and then determining the updated state via 



Pt 



Pa 



e^pI 



Trs (E^pa) 



(185) 



(186) 



Expressions equivalent to these conventional formulas 
have appeared previously in [33| , the only difference be- 
ing the appearance of a transpose, due to the use of the 
Choi isomorphism rather than the Jamiolkowski isomor- 
phism in [33] ■ 

We have seen that the remote collapse pA — > PA\Y=y is 
an instance of quantum Bayesian conditioning. Within 
interpretations of quantum theory wherein quantum 
states are considered to describe reality completely, the 
change of the state of ^4 as a result of a distant measure- 
ment upon B is sometimes considered to be an instance 
of action at a distance. Indeed, Einstein criticized the 
Copenhagen interpretation on exactly these grounds. On 
the other hand, the analysis above shows clearly that if 
one views quantum theory as a theory of Bayesian infer- 
ence, then upon learning the outcome of a measurement 
on i?, all that occurs is that one's beliefs about A are up- 
dated. No change to the physical state of A is required 
within such an approach. This interpretation is bolstered 
by the formal equivalence to the case of conditioning a re- 
gion on the outcome of a subsequent measurement, which 
does not seem to imply retrocausal influences (although 
for realist interpretations it has been argued that a im- 
posing a particular symmetry principle does imply retro- 
causality in this scenario [a, Q). 

Einstein anticipated such an epistemic interpretation 
of remote collapse in his writing, as is argued in 47i]. 
However, he most likely thought that the probabilities 
represented by quantum states could be probabilities for 
the values of physical variables (possibly hidden), and 
that these could satisfy classical probability theory. How- 
ever, by virtue of Bell's theorem 4S], such an interpreta- 
tion of remote collapse is not possible within the standard 



framework for hidden variable theories. We evade this 
no-go result here by understanding remote collapse as 
Bayesian updating within a noncommutative probability 
theory rather than within classical probability theory. In 
itself, this does not provide a viable realist interpretation 
of quantum theory, but it does suggest that an accept- 
able ontology for quantum theory ought not to include 
the quantum state. Finally, note that our interpretation 
of remote steering is broadly in harmony with that of 
Fuchs j22| . However, the view of quantum theory pre- 
sented here differs from that of Fuchs in that he views 
quantum theory as a restriction upon classical probabil- 
ity theory whereas we consider it to be a generalization 
thereof. 



B. Why the Projection Postulate is not an 
Instance of Bayesian Conditioning 

Finally in this section, we deal with the elephant in the 
room — the projection postulate. Some authors have ar- 
gued that the projection postulate is a quantum analog 
of Bayesian conditioning (see e.g. [3l|, [S^])- However, 
the projection postulate is not an instance of quantum 
Bayesian conditioning as defined above. In this section, 
we discuss the relationship between quantum Bayesian 
conditioning and the projection postulate (and quantum 
instruments in general) at some length, in order to dispel 
the misconception that projection is a kind of condition- 
ing. After pointing out the formal differences between 
the two, we explain the different types of update rule 
that are associated with a measurement in both classi- 
cal probability theory and quantum theory, pointing out 
where conditioning and projection fit into this picture. 
Then, we explain how, in the conditional states formal- 
ism, the projection postulate should be thought of as a 
composite operation, consisting of belief propagation to 
a later region followed by Bayesian conditioning. This is 
broadly in line with the treatment of quantum measure- 
ments advocated by Ozawa [4^,|50|- Finally, we deal with 
the possible objection that, although the classical analog 
of the projection postulate is not just Bayesian condition- 
ing, it can be thought of as conditioning combined with a 
simple relabeling of the system variable. This argument 
does not apply in quantum theory because, unlike in a 
classical theory, any informative measurement necessar- 
ily disturbs the system being measured. Although this 
is well known, we present a formulation of information- 
disturbance in terms of conditional states, which makes 
it clear that no quantum instrument can be thought of as 
conditioning combined with relabeling. This may be of 
interest in its own right, as it emphasizes the similarity 
between information-disturbance and other trade-offs in 
quantum theory, such as the monogamy of entanglement. 

In the conventional formalism, when a projective mea- 
surement {H^} is made on a system A, the Liiders-von 
Neumann projection postulate says that, upon learning 
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the outcome y, the initial state pA should be updated via 



PA 






(187) 



For a general POVM {Ey}, there is a natural general- 
ization of the projection postulate, given by 



PA 



{E^)ipAiE^)i 
Tta {E^Pa) 



(188) 



This generalization has also been proposed as a quantum 
analog of Bayesian conditioning |5lll53|. 

On the other hand, in §V A 21 the rule for conditioning 
a state on the outcome of a measurement was found to 
be 



PA 



TvA [EApj,) 



(189) 



This is distinct from eq. (|188p because the roles of pA 
and Ey have been interchanged. Furthermore, eq. (|188p 
is not equivalent to an equation of the form of eq. (|189p 
(even allowing a different POVM to appear therein) be- 
cause the map pA — >■ (^^) ^ PA (Ey) ^ , like all quantum 
instruments, is linear on the set of states on Ha, whereas 

- A - ■ 

the map p — > p\Eyp\ is nonlinear on the state space. 

If you are inclined to view the projection postulate 
as the correct quantum generalization of Bayesian con- 
ditioning, then you could take this as evidence against 
the idea that the conditional states formalism provides 
an adequate theory of quantum Bayesian inference. We 
shall therefore go to some length to defend the claim that 
neither the projection postulate nor any other quantum 
instrument is an analog of classical Bayesian condition- 
ing. 

A quan- 



Consider the scenario depicted in fig. 13(a) 



tum system located in region A and described by the 
state Pa is subjected to a measurement with outcomes 
labeled by Y , and the system persists after the measure- 
ment. At this later time, it is represented by a region A' 
which, as always in the present formalism, we distinguish 
from region A, but which is associated with a Hilbert 
space of the same dimension. As discussed in ijIIIKl a 
quantum instrument, {£y }, determines how the state 

of A' is related to the state of A, where £y [pa] is the 
unnormalized state of A' obtained when Y = y. 

Now consider the classical analogue of this scenario, 
depicted in fig. |13(b)[ A classical system described by 
the variable i?, and assigned a distribution P{R), is sub- 
jected to a (possibly noisy) measurement, resulting in the 
outcome y, which is a random variable that depends on 
R through a conditional probability distribution PiY\R). 
The system persists after the measurement, where it is 
described by a random variable R' . The value of R' is 
presumed to depend probabilistically on R, and the na- 
ture of this dependence may vary with the outcome Y . 




Upon learning Y = y 
QBCl: PA -^ Sa\y=v 

QBC2: PA' -^ QA'\Y=y 
QU: PA -> QA'\Y=y 

(a) 




Upon learning Y = y 

BCl: P{R) -^ P{R\Y = y) 
BC2: p\b!) -^ P{R'\Y = y) 
U; P{R) -> P{R'\Y = y) 

(b) 



FIG. 13: Causal diagrams for the state update rules associ- 
ated with quantum and classical measurements, (a) A quan- 
tum instrument, representing how the state of a quantum per- 
sistent system changes after a measurement, (b) The classical 



analog of a quantum instrument, representing how the state 
of a classical persistent system changes after a measurement. 



This is captured by a conditional probability P{Y, R'\R), 
which is the classical analog of a quantum instrument. 

It is useful to distinguish three kinds of update rules 
that might be considered in this scenario, as defined in 
fig. |13(b)] To describe the difi^erence between these rules, 
it is useful to introduce some terminology. A distribu- 
tion over R or R' is said to be a prior distribution if it is 
not conditioned on the value of Y, and it is a posterior 
distribution if it is conditioned on the value of Y. The 
temporal ordering that is implicit in this prior/posterior 
terminology specifies whether the distribution character- 
izes the knowledge you have before learning the value of 
Y or the knowledge you have after learning the value of 
Y. In other words, it refers to the time in your episte- 
mological history, relative to the event of learning Y. On 
the other hand, the system's configuration at the time 
before the occurrence of the measurement will be called 
its initial configuration and its configuration at the time 
after the occurrence of the measurement its final config- 
uration. R is the initial configuration and i?' the final. 
The temporal ordering implicit in this latter distinction 
refers to the time in the system's ontological history. 

Strictly speaking, Bayesian conditioning is a rule that 
updates what one knows about one and the same variable 
upon the acquisition of new information. In other words, 
it maps a prior distribution about some variable to a pos- 
terior distribution for that same variable. Consequently, 
rule BCl, which maps the prior distribution of the initial 
configuration of the system to the posterior distribution 
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of the initial configuration of the system, is an instance 
of Bayesian conditioning (it is analogous to updating a 
retrodictive state as discussed in W A2|l . Rule BC2 is 
also an instance of Bayesian conditioning: it maps the 
prior distribution of the final configuration of the system 
to the posterior distribution of the final configuration of 
the system. However, the rule U is not an instance of 
Bayesian conditioning because it maps the prior distri- 
bution of the initial configuration of the system to the 
posterior distribution of the final configuration. In other 
words, if one considers R and R' to be distinct variables, 
then any map from a distribution over one of them to 
a distribution over the other cannot be an instance of 
Bayesian conditioning. 

We now return to the quantum scenario, this time us- 
ing the conditional states formalism and paying attention 
to the analogy with the classical case. The measurement 
is associated with the causal conditional state QyA'\a 

that corresponds to the quantum instrument \£y \ ■ 

In the quantum conditional states formalism, there are 
analogues of each of the three rules we considered above. 
These are specified in fig. 13(a) The rule QBCl corre- 



sponds to updating a retrodictive state, as considered in 
ijVA2l The von Neumann-Liiders projection postulate 
is clearly an instance of rule QU. If one stipulates that 
quantum Bayesian conditioning is a rule that updates the 
quantum description of one and the same region upon ac- 
quiring new information, i.e. that it maps a prior state 
for a region to a posterior state for the same region, then 
QBCl and QBC2 are instances of quantum Bayesian 
conditioning, but QU is not. 



Y,j^P[Y,R'\R)P{R). Setting Y = y gives the Bayesian 
conditioning step as 



P{R') 



P{R'\Y = y) = 



y,i?'|i?)P(i?) 



P(Y = y) 



(191) 



The quantum analogue of this is straightforward. 
Quantum state update rules, such as the projection pos- 
tulate, are of the form QU: pA — > QA'\Y=y This is 
simply a sequential combination of quantum belief prop- 
agation 



PA -^ PA' = TrA (qa'IaPa) , 



(192) 



with quantum Bayesian conditioning via QBC2: pA' — > 

eA'\Y=y 

Again, it is useful to express each of these steps in 
terms of the quantities that are given in the problem: 
the causal conditional state Pya'IA ^-nd the prior pA- The 
causal conditional state qa'IA used in eq. (|192p is simply 

a reduced state of Qya'IAi i-6- Qa'IA = Try [Qya'\a)- To 
gain some intuition for this step, we translate it into con- 
ventional notation. If the quantum instrument associated 

with Qya'IA is denoted {£y ' }, then eq. (|192p becomes 



PA -^ PA' =£a'\a{p 



where 



E 



A'\A 



= E<'"- 



(193) 



(194) 



1. State-Update Rules as a Combination of Belief 
Propagation and Bayesian Conditioning 

If QU is not an instance of Bayesian conditioning, then 
what is its status within our framework? We now show 
that it is a composite of two operations: belief propaga- 
tion followed by Bayesian conditioning. 

First consider the classical analog. The analog of 
the projection postulate (or any state update rule aris- 
ing from an instrument) is given by rule U: P{R) —>■ 
P{R'\Y — y). This can be obtained by combining an 
instance of belief propagation, namely 



P{R) -^ P{R') = ^P(i?'|i?)P(i?), 



(190) 



The map £a'\a is the non-selective update map. It is the 
appropriate map to apply when you know that the mea- 
surement has been performed, but you do not know which 
outcome occurred. The standard update map, which is 
appropriate when one also knows the outcome, is the se- 
lective update map. The projection postulate and its 
generalization to POVMs are instances of selective up- 
date maps. Applying the non-selective update map is 
just an instance of quantum belief propagation. 

Turning to the Bayesian conditioning step, we have 
0A'\Y = PA'Y^Py^, where pa'y = Tr^ {gYA'\APA)- Com- 
bining these and setting Y = y gives 

PA' ^ gA'\Y=y = Tr^ {gY=y,A'\APA) * Pyty (195) 

In conventional notation, this translates into 



followed by the rule BC2: P{R') -^ P{R'\Y = y), 
which is an instance of Bayesian conditioning. It is use- 
ful to express both these steps in terms of the quanti- 
ties that are given in the problem, namely, the condi- 
tional P{Y, R'\R) and the prior over R, P{R). The condi- 
tional probability distribution P{R'\R) used in eq. (|190p 
is simply the marginal of P{Y,R'\R), i.e. P{R'\R) = 
^y P(F, i?'|i?). Meanwhile, the expression for the con- 
ditional is P{R'\Y) = P{R', Y)/P{Y), where P(P', Y) = 



PA' 



Pf 



£. 



A'\A 

y 



(pa) 



P{Y = y) ■ 



(196) 



Given the expression for pA' in eq. (|193|) . we see that 
QBC2 is simply a transition from your prior about the 
system output by the measurement, the result of ap- 
plying the non-selective update map, to your posterior 
about the system output by the measurement, the result 
of applying the selective update map and normalizing. 
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That this transition from non-selective to selective up- 
dates should be regarded as analogous to Bayesian con- 
ditioning has previously been argued by Ozawa [4^, [S^ . 
In fact, the rule QBC2 is a particular example of the 
kind of Bayesian conditioning considered in §V A II Ev- 
ery measurement with output region A' can be considered 
to define a preparation of A' for every outcome (assuming 
a fixed input state) . The set of states prepared is given 
by the components of gA'iYi which we can compute from 
the causal joint state Qya'a — Qya'\a * Pa by tracing 
over A and conditioning on Y . The rule QBC2 is then 
just Bayesian conditioning on F , thought of as a classical 
preparation variable. 



2. No Information Gain Without Disturbance 

We have argued that neither the classical rule U nor 
the quantum state update rule QU are instances of 
Bayesian conditioning. However, a skeptic might counter 
that our argument is an artifact of our insistence that the 
system before and after the measurement should be given 
different labels. If the conditional distribution P{R'\R) 
in the belief propagation rule of eq. (|190|) has the form 



P{R' ^r'\R = R) 



(197) 



where 5^' ^r is the Kronecker-delta function, then R and R' 
are perfectly correlated and consequently P{R'\Y — y) 
has precisely the same functional form as P{R\Y — y). 
In this case, one could say that the rule U is effectively 
just Bayesian conditioning. 

If P{R'\R) is just a delta function, then we say that 
the measurement is non- disturbing. Recall that for every 
PiY\R) that characterizes the outcome probabilities for 
a measurement, there are many conditionals (i.e. classi- 
cal instruments) P(Y,R'\R) that might characterize its 
transformative aspect and are consistent with P{Y\R). 
It is not difhcult to see that, among all such condition- 
als, there is always one for which the measurement is 
non-disturbing, namely, 



P{Y, R' = r'\R = r) = P{Y\R = r)Sr 



(198) 



Of course, there are also many ways of implementing 
a measurement of P{Y\R) such that it is disturbing. 
Therefore, whilst the update rule U is not an instance 
of Bayesian conditioning for every possible implementa- 
tion of the measurement, there is always at least one 
implementation such that it is effectively just Bayesian 
conditioning. 

The obvious question to ask at this point is whether 
it is possible to implement a quantum measurement in 
a non-disturbing way, such that the associated quan- 
tum update rule QU (perhaps the projection postu- 
late, perhaps some other rule) is effectively just quan- 
tum Bayesian conditioning. For the measurement to be 
non-disturbing, the causal conditional state qa'IA in the 



belief propagation rule of eq. (|192p would have to be of 
the form 

i.e it would need to be the partial transpose of the (unnor- 
malized) maximally entangled state. This corresponds 
to perfect correlation between A and A' because it is 
Jamiolkowski-isomorphic to the identity channel. If the 
belief propagation step in the rule QU were of this form, 
then gA'\Y=y would have the same functional form as 
QA\Y=yj cind the overall quantum update rule QU would 
be effectively just Bayesian conditioning. 

Of course, we are only interested in the case where the 
measurement is nontrivial. If the measurement gives no 
information about A, then the posterior is the same as 
the prior and no real conditioning has occurred. There- 
fore, we restrict our attention to the case where some 
information is gained. In the language of conditional 
states, the only kind of measurement that yields no in- 
formation about the input state is one associated with 
a causal conditional state that factorizes, that is, one of 
the form gY\A = Py (recall that, in our notation there is 
an implicit ^Ia on the right hand side of this equation). 
In conventional notation, this corresponds to a POVM of 
the form {P{Y — y)/^}, which generates a random out- 
come Y ^ y from the distribution P{Y) regardless of the 
state of A. We are interested in nontrivial measurements 
for which gY\A does not factorize in this way. 

With these definitions in hand, the answer to our ques- 
tion is a resounding "no" — a quantum state update rule 
of the form QU can never be effectively just Bayesian 
conditioning because, unlike the classical case, in quan- 
tum theory information gain necessarily implies a distur- 
bance. This prevents any QU rule, such as the projec- 
tion postulate, from being pure Bayesian conditioning. 
Whilst this fact is well-known, it is instructive to prove 
it in the conditional states formalism. 

Theorem V.2 (No information gain without distur- 
bance). Consider a measurement described by an instru- 
ment associated with the causal conditional state Qya'IA- 
R is impossible for this measurement to be both informa- 
tive about A (qy\a 7^ Py) cmd non- disturbing (qa'\a = 
T.,,k\j){k\A®\k){3\A')- 

The proof is a causal analog of the monogamy of en- 
tanglement (see [3j for related ideas). 

Proof. The operator Qya'\a is the partial transpose over 
A of an acausal conditional state Pya'\a- Combining this 
with the maximally mixed state gives a valid tripartite 
acausal state via 



PYA'A = PYA'\A * lA/d = PYA'\A/d, 



(200) 



where d is the dimension of "Ha . The condition that the 
measurement be non-disturbing is equivalent to pa'IA = 
Si k \j) (^U ® b) (^Ia'' which imphes that the tripartite 
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state PyA'A should have a reduced state on A' A that is 
maximally entangled. Meanwhile, the condition that the 
measurement be informative is equivalent to py\a 7^ Py, 
which implies that the reduced state on YA of pyA'A 
should not be a product state. But, by the monogamy of 
entanglement, any tripartite state pvA'A for which pAA' 
is maximally entangled must have a product state for its 
reduced state pya- Hence, both conditions cannot be 
satisfied simultaneously. D 

We end with another, less obvious, disanalogy be- 
tween the quantum and classical cases that bears on the 
question of how to interpret the quantum collapse rule 
in our framework. In the previous section, we showed 
that the classical update rule U could be decomposed 
into belief propagation followed by Bayesian condition- 
ing. We could just as well have decomposed it in the 
opposite order: Bayesian conditioning followed by be- 
lief propagation. Specifically, we could first apply BCl: 
P{R) —T' P{R\Y — y), and then propagate the con- 
ditioned state via P{R\Y ^ y) -^ P[R'\Y == y) = 
Y.j,P{R'\R,Y = y)P{R\Y = y). 

It is natural to ask whether such a reverse-order de- 
composition is possible in the quantum case. That is, 
can QU: pA -^ QA'\Y=y be decomposed into QBCl: 
PA — >■ QA\Y=y, followed by belief propagation QA\Y=y ^ 
8A'\Y=y'^ Perhaps surprisingly, this cannot be done. The 
belief propagation would have to have the form gA'\Y=y — 
TvA {gA'\A,Y=yQA\Y=y)- But to Compute gA'\A.Y=y from 
QY=y,A'\A we need to move Y from the left of the con- 
ditional to the right while keeping A on the right. Clas- 
sical analogy suggests that this could be done using a 
conditionalized form of the quantum Bayes' theorem, i.e. 



QA'\A,Y=y — QY=y,A'\A* eY= 



-■y\A- 



Unfortunately, in the 



conditional states formalism, valid equations do not nec- 
essarily remain valid when we conditionalize each term 
(this is discussed further in WIip . In particular, the con- 
ditionalized form of the quantum Bayes' theorem is not 
valid. Not every intuition from classical Bayesian infer- 
ence carries over into the conditional states formalism. 



VI. RELATED WORK 

In this section, quantum conditional states are com- 
pared to other proposals for quantum generalizations of 
conditional probability and the conditional states formal- 
ism is compared to several recently proposed operational 
reformulations of quantum theory. 



A. Comparison to Other Quantum Generalizations 
of Conditional Probability 

Several quantum generalizations of conditional proba- 
bility have been proposed in the literature, so it is worth 
comparing their relative merits to the conditional state 
formalism developed here. 



Firstly, Cerf and Adami have proposed an alternative 
definition of an acausal conditional state |17Hl9j (their 
definition does not extend to the causal case). For a 
bipartite state pab, the Cerf- Adami conditional state is 



J~) 



exp {log PAB - log PA <8) Ib) 



(201) 



This proposal has a close connection to the calculus of 
quantum entropies, since the conditional von Neumann 
entropy of a state pab can be succinctly written as 



S iB\ A) ^~Ttab[pab log p'^° 



CJO) 

A 



(202) 



which is analogous to the classical formula for the condi- 
tional Shannon entropy 



H{S\R) ^-J2 Pi^: S) log P{S\R). 



(203) 



R,S 



Similar compact formulas hold for other information the- 
oretic quantities, such as the quantum mutual informa- 
tion and conditional mutual information. 

In [53l | , Leifer and Poulin introduced a family of condi- 
tional states, again restricted to the acausal case, indexed 
by a positive integer n and given by 



Jn) 



(pab*Pa") 



(204) 



This unifies the Cerf-Adami conditional state with the 
definition used in the present work in the sense that 

Pb\a - Pb\a ^^°^ Pb\a — iiin«^oo Pb\a- 

The main concern of [53| was the generalization of 
graphical models and belief propagation algorithms to 
quantum theory, and their use in simulating many-body 
quantum systems and decoding quantum error correction 
codes. In this context, the n = 1 and n — > cxd cases are 
particularly interesting. The n — >■ oo case is the natu- 
ral one to use for simulating many-body systems and it 
allows for a simple generalization of one direction of the 
classical Hammersley-Clifford theorem, which character- 
izes the states on Markov Networks. On the other hand, 
the n = 1 case is more useful for decoding quantum error 
correction codes and, as outlined in the present paper, 
it extends to the causal case and has close connections 
to quantum preparations, measurements and dynamics 
that are lacking for other values of n. Given that dif- 
ferent applications work better with different definitions 
of the conditional state, it is probably fair to say that 
there is no uniquely compelling quantum generalization 
of conditional probability. 

With this in mind, note that Coecke and Spekkens have 
outlined a broad framework for generalizations of condi- 
tional probability within the category theoretic approach 
to quantum theory [5J]. Encouragingly, it is possible to 
derive generalizations of Bayes' theorem and condition- 
ing abstractly within this framework, but it is not yet 
clear what axioms within this framework are sufficient to 
capture all the important aspects of conditioning. 
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The final quantum generalization of conditional prob- 
ability to be considered here is the quantum conditional 
expectation (see [13| for the original paper and [ij- 
|l6 !| for reviews). This was proposed in the context of 
quantum probability theory, which is a non-commutative 
generalization of classical measure-theoretic probability 
within the framework of operator algebras. As such, it is 
well-defined for systems with infinite dimensional Hilbert 
spaces as well as for systems with an infinite number of 
degrees of freedom, for which there is more than one uni- 
tarily inequivalent Hilbert space representation. Also, 
it describes conditioning on an arbitrary algebra of ob- 
servables, rather than just on a tensor factor as has been 
considered here. Importantly for the present work, Redei 
has proposed an argument based on quantum conditional 
expectations purporting to show that quantum theory 
cannot be understood as a theory of Bayesian inference 
|55| (see also [.56]). 

However, quantum conditional expectations have a 
major flaw that is not shared by the conditional states 
formalism presented here. Fortunately, the full operator- 
algebraic machinery is not necessary to make the point; 
the case of finite dimensional Hilbert spaces and condi- 
tioning on a tensor factor suffices. Further details of the 
general case and how the formalism used below follows 
from it can be found in 14]. 

Classically, for a pair of random variables, R and S, a 
conditional expectation of S given R is a, positive map 
^R\s,R from functions of R and S to functions of R that 
satisfies 



$ 



R\R,S 



ifiR)) = f{R) 



(205) 



for all functions f{R) that are independent of S. Any 
such map is explicitly given by 



'^R\R,sifiR^ S)) = E ^(^l^)/(^' S), 



(206) 



where P(S\R) is a conditional probability distribution. 

Starting from a joint state P{R, S), one can obtain a 
conditional expectation by plugging the associated con- 
ditional probability P{S\R) into eq. (I206p . The main 
point of this is that it allows the expectation value of 
any function f{R, S) to be computed from the marginal 
probability distribution P{R) via 

Yl "^RlR^sUiR, S))P{R) - J2 /(^' ^)^(^' ^y (207) 



R 



R.S 



The set of functions on R and S can be thought of as 
the dual space to the set of probability distributions on 
R and S, where the linear functional / associated with 
f{R, S) is given by 



f{PiR,S))=J2P(.R,S)fiR,S), 



(208) 



R,S 



i.e. it is the functional that maps the probability distri- 
bution P{R, S) to the expectation value of f{R, S) with 



respect to P{R,S). With respect to this identification, 
a conditional expectation ^mpj^s h^s a dual map £r^s\r 
that maps the space of probability distributions over R 
to the space of probability distributions over R and S. 
This is given by 



£R^s\RiP{R)) = P{S\R)PiR) 



(209) 



and is called a state extension because every probabil- 
ity distribution P{R) gets mapped to a valid probability 
distribution P{R^ S) = P{S\R)P{R) on a larger space. 
In addition, state extensions that are dual to conditional 
expectations satisfy 



Y^£r,s\r{Q{R))^Q{R) 



(210) 



for every input distribution Q{R). 

In the finite dimensional, tensor factor case, the quan- 
tum conditional expectation of B given A is a completely 
positive [sol linear map ^a\ab '■ -2 CHab) — > -2 CHa) that 
acts on the set of observables on Hab and satisfies 



^A\AB (Ma «) Ib) 
for all operators Ma G £ {T~1-a 



Ma 



(211) 



This is analogous 
to the condition given in eq. (|205p . The dual map 
£ab\a ■ ^ C^a) — >■ £ CHab) acts on states and is a state 
extension, which means that 



TAB — £_ 



ABIA 



(ta) 



(212) 



is a valid state for any state ta on Ha- In addition, the 
state extensions that are dual to conditional expectations 
satisfy 



Trs {£ab\a (ta)) = TA 



(213) 



for every state ta on Ha, which is analogous to eq. (I210p . 
As in the classical case, one would like to associate 
every joint state pab with a conditional expectation, such 
that the dual state extension £ab\a satisfies 



£ 



AB\A 



(pa) = PAB, 



(214) 



i.e. it should give back the state that you started with 
when you input the reduced state. The analogous re- 
quirement is a key property of classical conditional prob- 
ability as it is what allows an arbitrary joint state to be 
broken up into a marginal and a conditional that are inde- 
pendent of one another. Unfortunately, the fact that eq. 
(I213p holds for every input state means that this require- 
ment can only be met for product states, i.e. states of the 
form Pab — Pa® Pb- This severely restricts the applica- 
bility of quantum conditional expectations for describing 
the correlations present in quantum states. Indeed, they 
are incapable of representing any correlations at all. This 
fact is known in the quantum probability literature (see 
|14| example 9.6), but here is an elementary proof. 

Ironically, the easiest way to show that pab has to be 
a product state is to use the conditional states formalism 
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as outlined in this paper. The state extension £ab\a 
is Jamioikowski isomorphic to a causal conditional state 
Qab\A' , where A' has the same Hilbert space as A and the 
' is just used to distinguish the input and output spaces. 
Then, eq. (|214l) can be rewritten as 



PAB = Tr^' {qabia'PA' 



(215) 



where pA' is the same state as pA- Similarly, eq. (|213p 
can be rewritten as 



TA = Tr^'s {qabia'TA' 
= Tr^' (qaia'Ta') , 



(216) 
(217) 



for all TA, where ta' is the same state as ta and gA\A' = 
Tr_B (qabia') ■ Since, eq. (j217p has to hold for every input 
state, gA\A' has to be Jamioikowski isomorphic to the 

identity superoperator, so gA\A' = l^"*") (^"""UfA" Since 
this is pure, monogamy of entanglement |8l| entails that 
Qab\A' must be of the form gAB\A' = l*^> (*^Uf^/®Afs 
for some operator Mb on Hb- Substituting this into eq. 
(Pr^ gives 



PAB ^ PA® Me 



(218) 



which shows that pab must be a product state and 
Mb = Pb = Tr^ {pab)- Therefore, conditional expecta- 
tions associated with joint states only exist for product 
states PAB = PA® Pb- 

It is unclear why quantum probabilists have not re- 
garded this as a fatal flaw in their definition of quantum 
conditional expectation. However, despite this problem, 
quantum conditional expectations are still worthy objects 
of study as they come up in a variety of contexts. For 
example, the projection onto the fixed point set of a com- 
pletely positive map is a quantum conditional expecta- 
tion. The point is just that the terminology "quantum 
conditional expectation" is an inaccurate way of describ- 
ing the way that these maps relate to quantum states. 

For comparison, the acausal conditional state defined 
in the present work can also be described as a map 
^AB\A ■ ^('Ha) — > ^iV-AB), similar to a state exten- 
sion, with the crucial difference that J^abia is not linear. 
For a conditional state Pb\a, the map Tab\a is defined 
as 



J^AB\A (pa) = PaPb\aPa- 



(219) 



The nonlinearity allows this map to satisfy eqs. (|213l) and 
()214|) without running into trouble with the monogamy 
of entanglement. Crucially though, because this map is 
nonlinear, it does not have a dual map that could be re- 
garded as a conditional expectation. Therefore, the close 
connection between conditional probabilities and condi- 
tional expectations breaks down within this formalism. 

Redei's argument against a Bayesian interpretation of 
quantum probabilities is based on a variant of Jeffrey 
conditioning with respect to a quantum conditional ex- 
pectation. In the present context, if the state of a region 



A is updated from pA to p^^ , then Redei asserts that the 
state for AB should be updated to p^g = £ab\a {pa"^^) j 
where £ab\a is a state extension derived from the prior 
state Pab- He then shows that this update rule fails to 
satisfy an important stability criterion in the infinite di- 
mensional case. Since his argument is crucially based on 
the linearity of conditional expectations and their duality 
with state extensions, it does not apply to the nonlinear 
maps J-^AB\A associated with quantum conditional states. 
However, since the failure only occurs for infinite dimen- 
sional systems, a full refutation will have to wait until the 
conditional states formalism has been extended beyond 
the finite dimensional case treated here. 

To reiterate, it seems unlikely that there is a unique 
quantum generalization of conditional probability that 
has properties analogous to every single property of clas- 
sical conditional probability that is traditionally regarded 
as important. For this reason, it is important to keep ap- 
plications in mind when defining quantum conditionals, 
rather than working in a formal mathematical vacuum. 



B. Operational Formalisms for Quantum Theory 

Recent efforts to replace the conventional formalism 
of quantum theory with a new operational formulation 
— typically in an effort to provide an axiomatic deriva- 
tion of quantum theory — have much in common with 
the work presented here. Particularly cognate to our ap- 
proach is the work of Hardy |57H59| . the Pavia group 
60, 61], Oreshkov et al. [64I and Coecke's group [631 ;64]. 

The reformulation of quantum theory presented by the 
Pavia group makes heavy use of the Choi isomorphism 
between quantum operations and bipartite states, and 
leverages this to represent quantum operations by oper- 
ators rather than maps. Mathematically, this is also how 
we achieve a unification of the treatment of acausally 
related and causally related regions. In particular, our 
proposition lIII.71 which specifies how to represent a quan- 
tum channel in terms of conditional states, is the counter- 
part to the expression of the action of a quantum chan- 
nel in terms of the link product. Note that the Pavia 
group uses the Choi isomorphism whereas we use the 
Jamiolkowski-isomorphism. The latter has the advan- 
tage of being basis-independent, so that the partial trans- 
poses that appear in the link product are absent in our 
approach. 

Hardy's latest work on reformulating quantum the- 
ory, using the duotens or framework [59|, also represents 
quantum operations as operators in much the same way 
as is done in the quantum combs framework and our own. 
Furthermore, Hardy's notion of circuit trace (an exam- 
ple of the causaloid product introduced in _57, 59]) pro- 
vides a unified way of representing a composition of maps 
as well as tensor products of system states, which is to 
say, a unified way of representing correlations between 
acausally related and causally related regions. The moti- 
vation for Hardy's work on the causaloid product is very 
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similar to the motivation for our own, namely, to formu- 
late quantum theory in a manner that is even handed 
with regard to possible causal structures. 

Recent work by Oreshkov, Costa and Brukner [64I 
also represents correlations between acausally related and 
causally related regions is a uniform manner by appealing 
to the Choi isomorphism. 

One notable way in which we depart from all of these 
approaches is that we treat classical systems internally 
to the formalism, on a par with quantum systems, rather 
than as indices on operators representing preparations 
and measurements. 

Finally, we compare our approach to the categorical 
approach of Coecke, where much of quantum theory (in 
particular the non-metrical parts) is reformulated using 
the mathematical framework of symmetric monoidal cat- 
egories and the graphical calculus that can be defined for 
these [63, ISJI • Systems are the objects of the category, 
while quantum states and quantum operations are the 
morphisms. The isomorphism between bipartite states 
and operations also features prominently in this approach 
and arises from having a compact structure in the cate- 
gory. Furthermore, classical systems can be treated in- 
ternally within this framework. It should also be noted 
that although Coecke's categorical framework is typically 
used to formulate quantum theory as a theory of physical 
processes, it can also be used to express our formulation 
of quantum theory as a theory of inference. For infer- 
ence among acausally related regions, this was done in 
54| . An extension of this work to the case of inference 
among causally related regions should be instructive. 

The mathematics of all of these approaches and our 
own are quite analogous. It is in the interpretational as- 
pect that the greatest differences are to be found. In 
the reformulations considered above, quantum theory is 
given a rather minimalist interpretation - it is viewed as a 
framework for making predictions about the outcomes of 
certain measurements given certain preparations. Quan- 
tum states — however they are reformulated mathemat- 
ically — are taken to be representations of preparation 
procedures, while quantum operations are taken to be 
representations of transformation procedures. These ap- 
proaches follow the interpretational tradition of opera- 
tionalism. By contrast, our work takes quantum states to 
represent the beliefs of an agent about a spatio-temporal 
region and takes quantum operations to represent be- 
lief propagation; it has an epistemological flavor rather 
than an operational one. For instance, the notions that 
we deem to be most promising for making sense of the 
quantum formalism are those one finds in textbooks on 
statistics and inductive inference, such as Bayes' theo- 
rem, conditional probabilities, statistical independence, 
conditional independence, and sufficient statistics and 
not the notions that are common to the operational ap- 
proaches, such as measurements, transformations and 
preparations. In this sense, our approach is more closely 
aligned in its philosophical starting point with quantum 
Bayesianism, the view developed by Caves, Fuchs and 



Schack [2g |65|-|6S 

The particular merit of our epistemological approach 
is the strong analogy that it affords between quantum 
inference and classical probabilistic inference. It makes 
the conceptual content of various quantum expressions 
more transparent than they would otherwise be. It also 
bolsters the view that quantum states ought to be inter- 
preted as states of knowledge. 

Our take on how to incorporate causal assumptions 
into quantum theory also has a rather different starting 
point than the works described above. The relation we 
posit between the notions of causation and correlation 
is most informed by the work on causal networks (also 
known as Bayesian networks), summarized in the text- 
books of Pearl [63 and of Glymour, Scheines and Spirtes 
[7(J |. We ultimately hope to generalize this analysis of 
causality by replacing classical probability theory with 
quantum theory, understood as a theory of Bayesian in- 
ference [531 . If the goal is to develop a formalism for 
quantum theory that is causally neutral then we argue 
that the causal network approach holds an advantage 
over the operational approach. Specifically, the prob- 
lem with taking experimental operations as a primitive 
notion is that they already have a notion of causal struc- 
ture built into them insofar as the output of an operation 
is causally infiuenced by the input, but not vice-versa 
(similarly, if the operation is a measurement, then the 
outcome is causally influenced by the input but not vice- 
versa). On the other hand, elementary regions — the 
primitive notion of the causal network approach — are 
precisely the sorts of thing that can enter into arbitrary 
causal relations with one another, whilst not having any 
intrinsic causal structure themselves. 



VII. LIMITATIONS OF THE *-PRODUCT 

Using the ^-product makes the conditional states for- 
malism look very similar to classical probability theory. 
Often, by replacing probabilities with operators and ordi- 
nary products with ik-products, one can obtain an equa- 
tion that is valid in the conditional states formalism from 
one that is valid for classical conditional probabilities. 
However, because the •-product is nonassociative and 
noncommutative, this does not always happen. In this 
section, the limitations of the •-product and the disanalo- 
gies between classical conditional probabilities and quan- 
tum conditional states are discussed. 



A. Conditionalized Equations 

In classic al p robability theory, if one takes a univer- 
sally valid 183] equation relating conditional, joint and 
marginal probability distributions over a set of variables 
R, S, . . . and conditionalizes every term on a disjoint vari- 
able T, then the equation that results is still universally 
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valid. For example, the equation 

P{R,S) = P{S\R)P{R) (220) 

generalizes to 

PiR,S\T)^ P{S\R,T)P{R\T), (221) 

where a T has been placed on the right of the | in each 
term. 

Unfortunately, the analogous property does not hold 
in the conditional states formalism. For example, for 
acausal states the analog of eq. (I22ip would be 



Pab\c = Pb\ac * Pa\c- 

Writing this out explicitly, the left hand side is 

_i _i 

Pc^PABCPc'' ^ 

whereas the right hand side is 



Pc PAcPc PacPabcPac 



h[pc' 



PACPc' 



(222) 



(223) 



(224) 



Because the ^-product is noncommutative, the terms in- 
volving pac and p^^ cannot be brought together and 
made to cancel as they would in the classical case. There- 
fore, counterexamples to this rule can occur when pAc 
and Pc do not commute. For example, it is straightfor- 
ward to verify that a generalized W-state of the form 



IV-) 



ABC 



(1001) 



ABC 



1010) 



ABCl 



V2 



1 100) 



ABC 



(225) 



does not satisfy this rule. The calculation is not espe- 
cially instructive, so it is omitted. 

Note that a universally valid equation relating quan- 
tum conditional, joint and marginal states is still univer- 
sally valid if one conditionalizes every term on a classical 
variable. This follows from the equality of the expres- 
sions for the left hand and right hand sides of eq. (|222p 
when G is replaced by a classical variable T. 



B. Limitations of Causal Joint States 



In ijIII E[ a causal joint state was defined as an operator 
of the form Qb\a'^Pa- This representation of two causally 
related regions highlights the symmetry with the acausal 
case, since, up to a partial transpose, it is the same sort 
of operator that would be used to represent two acausally 
related regions. Based on this, the quantum Bayes' the- 
orem was developed in a way that is formally equivalent 
for acausally and causally related regions. For this to 
work, we only needed causal joint states for two causally 
related regions. However, since acausal states are not 
limited to just two regions, it is natural to ask whether 



causal joint states can be defined for more than two re- 
gions. Unfortunately, the naive generalization does not 
work for scenarios with mixed causal structure, e.g. two 
causally related regions that are both acausally related 
to a third regions, and it also does not work for multiple 
time-steps. 

In the remainder of this section, these limitations are 
discussed and a different definition of a causal joint state 
is suggested, which works more generally, but does not 
exhibit the symmetry between the acausal and causal 
scenarios for two regions. 



1. Mixed Causal Scenarios 

If region B is in the causal future of region A, then 
they can be assigned a causal joint state qab — Qb\A'*'^Pa- 
The causal conditional state is Jamiolkowski isomorphic 
to the dynamical CPT map £b\a and pA is the input 
state. Now suppose that there is a third region, C, that 
is acausally related to both A and S, i.e. we start with a 
joint state pAc of A and C and apply £b\a to region A, 
whilst doing nothing to C, to obtain B and C in a joint 

state pBC = {£b\a®1-c) (pac)- 

One might think that a joint state of ABC could be 
defined via qabc = Pb\A'*' Pac- We would like this state 
to have the correct input and output states as marginals, 
so, in particular, it should satisfy 



PBC = £b\a (pac) = Tr^ (qabc) 



(226) 



Unfortunately, this fails because theorem IIII.4I implies 
that 



whereas 



PBC = Tr^ (qbiaPac) , (227) 



Tr^ (qabc) = Tr^ {qb\a * Pac) ■ (228) 



Because there is no trace over C, the cyclic property of 
the trace cannot be used to equate these two expressions. 
In fact, in addition to not having pBc as a reduced 
state, qabc fails to correctly represent the correlations 
between the causally related regions A and B as well, i.e. 

Trc {qabc) 7^ 6AB- 

To see the failure of both these conditions explicitly, 
consider an example in which Hai T~Lb and "He are all 
of dimension d, the input state pAc — 2 \^^) ('^^Iac 
is a maximally entangled state, and £b\a is the iden- 
tity superoperator. Then, the output state is pbc = 
1 1$+) ($+|gp, and the causal joint state is qab — 

In this case, explicitly calculating the operator qabc — 
Qb\a*Pac gives 



Qabc = 3 I* 



-)($^ 



lAC 



d ' 



(229) 



Whilst the reduced state pAC gives the correct input 
state, Pbc is not the output state and qab is not the 
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causal joint state. In fact, no operator of the form 
Qabc = ?s|A * Pac can ever satisfy all three condi- 

tions PAC = ^ 1$+) (*+Uc: SAB = i 1$+) ($+|^B and 
Pbc ~ 2 \^^) {^^Ibc simultaneously. This is because 
QABC is a locally positive operator, and hence it must 
satisfy the monogamy of entanglement, but requiring all 
three bipartite reduced states to be maximally entangled 
would violate monogamy and, in fact, only one of the 
three conditions can be satisfied. Because of this, we 
have to move beyond locally positive operators in order 
to faithfully represent all the correlations. 

The validity of eq. (|227p suggests that using an or- 
dinary product instead of a ^-product could be a better 
way of defining a joint state in this scenario. Indeed, 
the operator gB\APAC does have all the correct bipartite 
marginals. For example, in our example of maximally 
entangled input and identity map dynamics, explicit cal- 
culation gives 

1 '^ 
0b\aPac = -^ Y^ \j){k\^(g)\m){j\g(E)\m){k\(j. 

j,k,m—l 

(230) 
This violates the monogamy of entanglement, which is 
allowed because it is not a locally positive operator. In 
fact, it is not even Hermitian. 

Whilst this may turn out to be a useful representa- 
tion, it has a number of disadvantages compared to the 
•-product. First of all, it is non-unique because the op- 
erator pacQb\a gives an equally good account of all the 
correlations. One could even take combinations of the 
two operators, such as 



dependent given B, can always be decomposed via the 
chain rule into 



1 



(Qb\aPac + PacQb\a) 



(231) 



which might be useful because it is Hermitian. 

Secondly, given an arbitrary operator Mab, it is not 
clear how to check whether it is of the form Qb\aPa with- 
out running over all possible input states pA and checking 
whether MabP^ is a valid causal conditional state. In 
contrast, when using the ^-product, we know that an op- 
erator is of the form Qb\a^PA iff it is the partial transpose 
of a valid acausal state on AB^ which is a straightforward 
condition to check. 

Finally, a related issue is that, when using the ordi- 
nary product instead of the Tkr-product, the set of possible 
causal joint states depends on the causal direction. For 
an evolution from A io B, a causal joint state would be 
of the form Qb\aPAi but for an evolution from i? to A it 
would be of the form qa\bPb- These define two different 
sets of operators, so we lose the symmetry that was used 
to obtain Bayes' theorem in the causal case. 



2. Multiple Time-Steps 

For three space-like separated regions, a Markovian 
joint state pabc^ where A and C are conditionally in- 



Pabc = Pc\B * {pb\a * pa) 



(232) 



For three time-like separated regions, the analog of 
Markovianity is a two time-step dynamics where the first 
CPT map is £b\a ■ £(Ha) -> £(^b) and the second is 
£c\b ■ -2 ("Hb) — ^ £ CHc), i-e. it has no direct dependence 
on ^. It is natural to ask whether this situation can be 
represented by a tripartite causal joint state that can be 
decomposed in a manner similar to eq. (j232p . 

Suppose that the causal conditional states isomorphic 
to £b\a and £c\B arc gB\A and gc\B- If the input state 
is Pa then the first time-step is represented as 



Pb = £b\a (pa) 


(233) 


= Tr^ {qb\aPa) 


(234) 


and the second time-step is represented as 




PC = £c\B (pb) 


(235) 


= Trs {qc\bPb) ■ 


(236) 


It follows that 




PC = £c\B °£b\a{pa) 


(237) 


= Trs (pcisTrA {qb\aPa)) , 


(238) 



The question is whether the complete dynamics might 
also be representable as 



PC = TrAs (6'c|s*(6's|a*Pa)) 



(239) 



which in turn would suggest that qabc — Qc\b * 
(qbia'*^ Pa) rnight be a good candidate for a tripartite 
causal joint state. 

This fails because the expression qabc — Qc\b * 
\Qb\A'*^ Pa) is not well defined. To see this, expand the 
first 7k-product to obtain 



qabc = yjQB\A * PAQc\ByjQB\A * PA- 



(240) 



The term y/QB\A * PA is not well defined because Qb\a'*'Pa 
is not a positive operator, but only locally positive, so it 
may have negative eigenvalues. Hence, it does not have 
a unique square root. This could be remedied by adopt- 
ing a convention for square roots of Hermitian operators, 
such as demanding that the square root of each negative 
eigenvalue has positive imaginary part. The resulting 
tripartite operator qabc would then have the correct re- 
duced states. Pa, Pb and pc, representing the state of 
the system at each time-step. However, it is not related 
to a tripartite state of three acausally related regions via 
partial transposes, so the symmetry that motivates the 
use of the •-product representation is lost. This loss of 
symmetry resonates with previous work showing that tri- 
partite "entanglement in time" is not isomorphic to or- 
dinary tripartite entanglement [2|. 
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As with the evolution of a subsystem, these problems 
can be remedied by using the ordinary product instead 
of the ^-product to represent time evolutions, but it is 
subject to the same disadvantages that were discussed in 
that context. 



VIII. OPEN QUESTIONS 

A. The Quantum Conditionals Problem 

Monogamy of entanglement is a key feature that dis- 
tinguishes classical from quantum information. There is 
a closely related difference between classical conditional 
probability distributions and acausal quantum condi- 
tional states that deserves further investigation. Clas- 
sically, if P{R) is a probability distribution and P{S\R), 
P{T\R,S) are conditional probability distributions then 



P{R, S, T) = P{T\R, S)P{S\R)P{R) 



(241) 



is always a valid probability distribution. Furthermore, 
the distribution so defined has the correct marginal and 
conditional states in the sense that 



Y,P{R,S,T) = P{R) 

S,T 

j:TPiR,S,T) 



Es.TPiR,S,T) 
P{R,S,T) 



- PiS\R) 
-P(T|i?,5), 



(242) 

(243) 
(244) 



whenever the left hand sides are well defined. 

In the quantum case, the analogous properties do not 
hold. Although a tripartite state pabc can always be 
decomposed as 



PABC = PC\AB * PB\A * PA 



(245) 



via the chain rule, one cannot start with an arbitrary 
reduced state pA and two arbitrary conditional states, 
Pb\a and Pc\AB^ and expect there to be a joint state 
PABC that has these conditional and reduced states. 

For example, suppose that B and C are conditionally 
independent of A, i.e. Pc\AB — Pc\A- Now, suppose 
that Pa is chosen to have more than one nonzero eigen- 
value and Pb\a is chosen to be maximally entangled, e.g. 
Pb\a = 1*^^) {^^\b\a- "^^i^ implies that the reduced 
state pAB — Pb\a*Pa is pure and entangled. If, in addi- 
tion, pc\A is chosen to be pc\A ~ \^^) {^~^\c\A^ ^^'^^^ ^^^ 
reduced state pAc = Pc\a~^Pa should also be pure and en- 
tangled. However, monogamy of entanglement says that 
this is impossible, so these choices of conditional state are 
not compatible. Determining the full set of constraints 
on coexistent conditional states for three acausally re- 
lated regions would be an interesting problem, as would 
determining the computational complexity of the n-party 
generalization. 



IX. CONCLUSIONS 

The formalism of quantum conditional states presented 
in this paper provides a step towards a formalism for 
quantum theory that is independent of causal structure, 
as a theory of probabilistic inference ought to be, and 
provides a closer analogy between quantum theory and 
classical probability theory. There is significant potential 
to use these results to simplify and generalize existing 
approaches to problems in quantum information theory. 
As an example of this, in a companion paper [7l| we pro- 
vide an approach to the problems of compatibility and 
pooling of quantum states that is based on a principled 
application of Bayesian conditioning and is a direct gen- 
eralization of existing approaches to the classical versions 
of these problems. It seems unlikely that the possibility 
of this approach would have been noticed within the con- 
ventional quantum formalism. However, this is only the 
beginning and we anticipate applications to quantum es- 
timation theory and to quantum cryptography, such as 
studying the relationship between cryptography proto- 
cols that employ different causal arrangements to achieve 
the same task. As it stands, the formalism is limited to 
two disjoint elementary quantum regions and the most 
pressing problem is to generalize it to arbitrary causal 
scenarios. This is a topic of ongoing work. 
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Appendix A: Proofs of Theorems 



Theorem IIII.4I (Jamiolkowski Isomorphism). Let 
2;b|/i : £,{'Ha) -^ ^{'Hb) be a linear map and let 
Mac G 'C('Hyic); where "He *s a Hilbert space of arbi- 
trary dimension. Then, the action of ^b\A on £,{'Ha) 
(tensored with the identity on £, [T-Lc)) is given by 



(€ 



B\A 



)Xc) [Mac] = T^ta {pb\aMac) 



(Al) 
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where Pb\a G ■£ (Hab) is given by 



PB\A = [<tB\A'®'U) I Y. !■?■) ^^\a ® \k) O'L 



(A2) 



Here, A' labels a second copy of A, Ia is the identity 
superoperator on £,{'Ha), and {\j)} is an orthonormal 
basis for Ha ■ 

Furthermore, the operator Pb\a is an acausal condi- 
tional state, i.e. it satisfies deftnition MIL 1[ if and only 
if (£b|a o Ta is CPT, where Ta : S.{Ha) ^ 2,{Ha) de- 
notes the linear map implementing the partial transpose 
relative to some basis. 



as follows. 



{£b\a®^c){Mac) = 

= X! ^^^\aa' Pb\A'Mac \kk) 



AA' 



J,fe 



Y.^^\A'PB\A'\k)^,{j\j,MAc\k), 



j.k 



51 01a Ps|a|A:)^(jU Mac |fc). 



],k 



Y.^^\APB\A\k) A{k\AMl^c\3) A 



],k 



Y.^3\APB\A\k) A{k\AMl''c\3) A 



j,k 



EO'IaPsiamJ^Ij) 



To prove this theorem, it is useful to make use of the 
connection between the Choi and the Jamiolkowski iso- 
morphisms. The Choi isomorphic map to an operator 
Pb|a is given by 



£b\a (pac) - (*+ \aa' Pbia'Mac \'^^)aa' ' (^3) 



where |*&~'')Ayi/ = X]i \JJ)aa' i^ ^ canonical maximally 
entangled state defined with respect to a preferred basis 
{\j)} for Ha- 

The operator is recovered from the map via 



PB|A^(£B|A'0lA)(|<i>+)(<f>+LA,) 



(A4) 



= TyA[pB\AM^^ 



{[^b\aoTa]®Ic){Mac) (A6) 



Proof of theorem |///.^[ Eq. (|A1|) is derived from eq. (jA2 
as follows: 

((£b|a (^Ic) (Mac) = ((£s|A' ® 2:c) (Ma'c) 



= {eB\A'®Ic 



Ei^)(^U' 



M 



'J-A'C 



Eij')0U' 



= {^B\A'®^c) J2 (^Ia' Ma'c \j)a' \k) Ul 



j,k 



= (CsiA' ®Ic){j2 (^\a Mac \j)a \k) 01. 



j,k 



= (€b|A' ®Ic) I Et^a (Ij) {kUMAc) \k) {j\a, 



Because 



E 1^') (^Ia ® 1^) OIa' - (|*+) (*+Iaa') ' (A5) 

J,k 



eqs. (|A4|1 and (|A2p differ only by whether one uses the 
projector onto the maximally entangled state (Choi) or 
the partial transpose thereof (Jamiolkowski) and the two 
isomorphic maps to p^i^ are related by €b\a — ^b\a°Ta, 
where Ta is the partial transpose operation with respect 
to the basis used to define the Choi isomorphism. 

The equivalence of eq. (jA3[) and eq. (jAll) is established 



^Tta I {eB\A'^Ic) 



Eo)(^u®ifc)oiA' \mac 

j,k 

= TrA (pbiaMac) (A7) 



Now suppose that Pb\a is an acausal conditional state, 
i.e. it is positive and Tr^ (pbia) — Ia- To show that the 
Jamiolkowski isomorphic map composed with a partial 
transpose ^b\a ° Ta is trace preserving, note that Ta is 
trace preserving, so it suffices to show that ^b\a is trace 
preserving. This proceeds as follows: 

Trs i^BiA (Ma)) - Trs (Tr^ {pb\aMa)) (A8) 

- Tr^ (Trs {pb\a) Ma) (A9) 

= Tr^ (IaMa) (AlO) 

= Tr^ (Ma) . (All) 
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To show that ^b\a°Ta is completely positive, note that it 
is equal to the Choi isomorphic map £b\Ai so it sufBccs to 
show that the latter is completely positive. By definition 

£-S|^(g)Zc (PAC) = (*+L^, PB\A'<»PAC \^^)jiA' (^l^) 

and this is a positive operator for arbitrary positive pAC, 
where Jic can have any dimension. 

Conversely, suppose <Bb\a ° Ta is CPT. Then, €b\a is 
also trace preserving, so 



Trs {pb\a) =T^b\J2 \j) (^U ® ^B\A m {jU^] 

\3:k 

(A13) 

= ^|j)(fc|^TrB(€B|A(|fc)(jUO) (A14) 

j,k 

= Elj)(^U®Tr^'(|fc)0'U') (A15) 

= T.\-j)(^\Ahk (A16) 

j,k 

= EIj)01a (A17) 

3 

= I A. (A18) 



Also, 



PB\A ^ {Sb\A' ®:Ia) {\^+) {^^ 



\AA' 



(A19) 



and this is a CPT map acting on a positive operator, so 
Pb\a is positive. D 



Theorem nil, ill Let (Ja\x be a hybrid operator, so that 
by eq. |^0| ^ it can be written as 



0-A|X ^^^Px^l^) {x\ 



X ' 



(A20) 



for some set of operators {p^}- Then, (Ta\x satisfies the 
definition of both an acausal and a causal conditional 
state for A given X , iff each of the components p^ is 
a normalized state on Ha ■ 

Proof. Suppose a-A\x has the form of eq. (|A20|) for a set 
of normalized states {p^}- Then it is clearly positive and 
satisfies Tr^i {<yA\x) = Ix because Tr^ (p^) = 1 for every 
X. Therefore, aA\x is an acausal conditional state. On 
the other hand, ctaix is invariant under partial transpose 
on X, so it is also a causal conditional state. 

Conversely, suppose that <Ja\x is an acausal con- 
ditional state. This means that it is positive and 
satisfies Tr^ {o'a\x) = Ix- Positivity means that 



AX '^A\x 



'AX 



> for all 



/AX 



the form of eq. (jA20|l then taking 

^A 



eUA 



X- 



I AX 



\iaA\x has 

\<i>)A®\^)A 

gives {(P\aP^ \(p) a ^ 0. By varying over all |(/))^ e "Ha, 
this implies that each p^ is a positive operator. To prove 
that these operators are normalized note that 



TrA {^A\x) 



;TrA(p^)|x)(:r| 



X 



(A21) 



This is an eigendecomposition of Tr^. {(yA\x) with eigen- 
values Tryi (p^) and if this is the identity operator then 
each of these eigenvalues must be 1. 

On the other hand, if a a\x is a causal conditional state, 
then its partial transpose over X must be positive and 

satisfy Tr^ ( Qaax ] — ^x- However, operators of the form 
of eq. (|A20p are invariant under partial transpose on AT, 
so the same argument applies. D 



Theorem nil. 13l Let Oy\a be a hybrid operator so that 
it can be written in the form 



<^Y\A^J2 \y^ ^y^Y ® E^ 



(A22) 



for some set of operators {E:^}. Then, ayiA satisfies 
the definition of both an acausal and a causal conditional 
state for Y given A iff the components E^ form a POVM 
on Ha, i-e. each Ey is positive and ^ Ey ~ La- 

Proof. Suppose crY\A has the form of eq. (|A22[) for a 

POVM {Ey}. Then it is clearly positive and satisfies 

Try (cry|A) = ^y^y = ^A- Therefore, aY\A is an 
acausal conditional state. On the other hand 



'y A 



\y) (vIy ® (E: 



Ta 



(A23) 



is also positive because the positive operators Ey re- 
main positive under the transpose. Also, Try I CTyf. j — 

Sy (Ey) ^ = Ia'^ ~ Ia- Therefore, ay^A is also a causal 
conditional state. 

Conversely, suppose that (Tyia is an acausal condi- 
tional state. This means that it is positive and satisfies 
Try (ctyia) = I A- By the same argument used in the 
proof of theorem IIII.lll positivity implies that, if aY\A 
is of the form of eq. (|A22[) , then each of the components 
Ey must be positive. Since Try [cfy^a) = '^y^y, the 
components must form a POVM. 

On the other hand, if <Jy\a is an acausal conditional 
state then, by the argument just given, its partial trans- 
pose over A must be of the form of eq. (jA22p for some 
POVM {Ey}. This means that aY\A itself can be written 
as 



^y\A ^^\v) (2/I1 



{E^f' 



(A24) 



but since the operators {Ey^ "* form a POVM whenever 
{Ey} is a POVM, aY\A is of the required form. D 



Theorem IIII.151 Let £b\a, £c\b o-''^d £c\A be lin- 
ear maps such that £c\A — £c\B ° ^■b\a- Then, the 
Jamiolkowski isomorphic operators, gB\A, Qc\B <md Qqia 
satisfy 



gc\A = TrB {gciBQBiA) 



(A25) 
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Conversely, if three operators satisfy eq. (jA25[) . then the 
Jamiolkowski isomorphic maps satisfy £c\A = ^CIB ° 
£b\a- 

Proof. By definition, the Jamiolkowski isomorphic oper- 
ator to £c\A is 



ec\A = {£c\A'®Ia) J2 I^') <^Ia ® 1^) 01a' (A26) 
= J2 \J) (fc| ® [^c|B ° £b\A' i\k) (jU,)] (A27) 

3,k 



Applying theorem llll. 61 to £b\A' gives 
Qc\a = 



E 1^') (^U ® ^c|B (Tr^' (to|A' |fc) (jU,)) 

J,k 

= E !■?■) (^Ia ® ^c|B ((jU- esiA' \k)A.) , (A28) 

and then applying the same theorem to £c\b gives 

Qc\A = 

E IJ) ('^Ia ® O'U' Trs (ecistolA') |fc>A, . (A29) 

3,k 



Since A' is a dummy label in this expression, it can be 
changed to A and then 



0c\A = E 1-^') ^-^'Ia Trs (ecistelA) \k) {k\^ (A30) 
= Trs (ec|sto|A) • (A31) 



For the converse direction, we have 

£c\A (Ma) = Tr^ {qc\aMa) 

= Tta (Trs {gc\BQB\A) Ma) 
= Trs (gcisTr^ (pbia^-'^a)) 
= Trs (fiicis^BlA (^-^a)) 
= fc|B {£b\a (Ma)) 
= fc|B O'^BIA {Ma) ■ 



(A32) 
(ASS) 
(AS4) 
(ASS) 
(AS6) 
(AS7) 

D 
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Note that the term "belief propagation" has also been 
used to describe message-passing algorithms for perform- 
ing inference on Bayesian networks. This is not the in- 
tended meaning here. 

Although it is not required here, one can be more pre- 
cise about this distinction as follows. A causal structure 
for a set of quantum regions is represented by a directed 
acyclic graph wherein the nodes are the regions and the 
directed edges are relations of causal dependence (the re- 
striction to acyclic graphs prohibits causal loops). Two 
regions are said to be causally related if for all paths con- 
necting one to the other in the graph, every edge along 
the path is directed in the same sense. Two systems are 
said to be acausally related if for all paths connecting one 
to the other, not every edge along the path is directed in 
the same sense. When there exist both sorts of paths be- 
tween a pair of nodes, the associated regions are neither 
purely causally nor purely acausally related. We do not 
consider this case in the article. 

There does exist a classical analog of this dependence on 
causal structure for pairs of regions, but it requires con- 
sidering the case of a classical theory with an epistemic 
restriction [73]. We do not pursue the analogy here. 
If \tp)BiA derives from a joint pure state \tp)^gvia eq. (|12p 
then this only implies that dim(supp(pA)) < dim('Hs), 
which is always true because the ranks of pA and ps are 
equal. 

Even if we do not adopt the convention of evaluating 
partial transposes in the preferred basis, the sets of 
acausal and causal conditional states are still isomorphic. 
If { \x)j^ } is the preferred basis for acausal states then this 
amounts to choosing a different preferred basis {|a;*)x} 
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for causal states, where * is complex conjugation in the 
basis used to define the partial transpose. However, this is 
an unnecessary complication that is avoided by adopting 
the recommended convention. [82] 

[79] Strictly speaking, this is only a special case of Bayesian 
conditioning, which can be formulated more generally for 
arbitrary events on a sample space rather than just for 
random variables. We restrict attention to the special 
case of conditioning one random variable upon another 
for ease of comparison with quantum theory. 

[80] The definition only calls for positivity, but it is a theorem [83] 
that all quantum conditional expectations are completely 
positive (see [14||'). 

[81] The monogamy of entanglement is well known for pos- 
itive operators. The fact that it also applies to locally 
positive operators follows one of the results of 73] , which 



shows that monogamy applies to more general probabilis- 
tic theories, including one in which states are locally pos- 
itive operators. 

Unlike the quantum Bayesians, however, we are not com- 
mitted to the notion that the beliefs represented by quan- 
tum states concern the outcomes of future experiments. 
Rather, the picture we have in mind is of the quantum 
state for a region representing beliefs about the physical 
state of the region, even though we do not yet have a 
model to propose for the underlying physical states. 
Universal validity means that the equation holds for 
the joint, marginal, and conditional probability distri- 
butions derived from any joint probability distribution 
P{R,S,...), rather than holding only in special cases, 
e.g. due to symmetries or degeneracies of a particular 
distribution. 



